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Introduction 

he  discipline  of  game  theory  was  pioneered  in  the  early  20th  century  by 

T mathematicians  Ernst  Zermelo  (1913)  and  John  von  Neumann  (1928). 
The  breakthrough  came  with  John  von  Neumann  and  Oscar 
Morgenstern’s  book,  Theory  of  games  and  economic  behavior,  published 
in  1944.  This  was  followed  by  important  work  by  John  Nash  (1950-51) 
and  Lloyd  Shapley  (1953).  Game  theory  had  a major  influence  on  the 
development  of  several  branches  of  economics  (industrial  organization, 
international  trade,  labor  economics,  macroeconomics,  etc.).  Over  time  the 
impact  of  game  theory  extended  to  other  branches  of  the  social  sciences 
(political  science,  international  relations,  philosophy,  sociology,  anthropology, 
etc.)  as  well  as  to  fields  outside  the  social  sciences,  such  as  biology,  computer 
science,  logic,  etc.  In  1994  the  Nobel  prize  in  economics  was  given  to  three 
game  theorists,  John  Nash,  John  Harsanyi  and  Reinhardt  Selten,  for  their 
theoretical  work  in  game  theory  which  was  very  influential  in  economics.  At 
the  same  time,  the  US  Federal  Communications  Commission  was  using  game 
theory  to  help  it  design  a $7-billion  auction  of  the  radio  spectrum  for  personal 
communication  services  (naturally,  the  bidders  used  game  theory  too!).  The 
Nobel  prize  in  economics  was  awarded  to  game  theorists  three  more  times:  in 
2006  to  Robert  Aumann  and  Thomas  Schelling,  in  2007  to  Leonid  Hurwicz, 
Eric  Maskin  and  Roger  Myerson  and  in  2010  to  Lloyd  Shapley  and  Alvin  Roth. 

Game  theory  provides  a formal  language  for  the  representation  and  analysis  of 
interactive  situations,  that  is,  situations  where  several  “entities”,  called  players, 
take  actions  that  affect  each  other.  The  nature  of  the  players  varies  depending 
on  the  context  in  which  the  game  theoretic  language  is  invoked:  in 
evolutionary  biology  (see,  for  example,  John  Maynard  Smith,  1982)  players  are 
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non-thinking  living  organisms;  in  computer  science  (see,  for  example,  Shoham- 
Leyton-Brown,  2008)  players  are  artificial  agents;  in  behavioral  game  theory 
(see,  for  example,  Camerer,  2003)  players  are  “ordinary”  human  beings,  etc. 
Traditionally,  however,  game  theory  has  focused  on  interaction  among 
intelligent,  sophisticated  and  rational  individuals.  For  example,  Aumann 
describes  game  theory  as  follows: 

“Briefly  put,  game  and  economic  theory  are  concerned  with  the 
interactive  behavior  of  Homo  rationalis  - rational  man.  Homo 
rationalis  is  the  species  that  always  acts  both  purposefully  and 
logically,  has  well-defined  goals,  is  motivated  solely  by  the  desire 
to  approach  these  goals  as  closely  as  possible,  and  has  the 
calculating  ability  required  to  do  so.”  (Aumann,  1985,  p.  35.) 

This  book  is  concerned  with  the  traditional  interpretation  of  game  theory. 

Game  theory  is  divided  into  two  main  branches.  The  first  is  cooperative  game 
theory,  which  assumes  that  the  players  can  communicate,  form  coalitions  and 
sign  binding  agreements.  Cooperative  game  theory  has  been  used,  for  example, 
to  analyze  voting  behavior  and  other  issues  in  political  science  and  related 
fields.  We  will  deal  exclusively  with  the  other  main  branch,  namely  non- 
cooperative  game  theory.  Non-cooperative  game  theory  models  situations 
where  the  players  are  either  unable  to  communicate  or  are  able  to  communicate 
but  cannot  sign  binding  contracts.  An  example  of  the  latter  situation  is  the 
interaction  among  firms  in  an  industry  in  an  environment  where  antitrust  laws 
make  it  illegal  for  firms  to  reach  agreements  concerning  prices  or  production 
quotas  or  other  forms  of  collusive  behavior. 

The  book  is  divided  into  three  parts. 

Part  I deals  with  games  with  ordinal  payoffs,  that  is,  with  games  where  the 
players’  preferences  over  the  possible  outcomes  are  only  specified  in  terms  of 
an  ordinal  ranking  (outcome  o is  better  than  outcome  o’  or  o is  just  as  good  as 
o').  Chapter  1 covers  strategic-form  games,  Chapter  2 deals  with  dynamic 
games  with  perfect  information  and  Chapter  3 with  dynamic  games  with 
(possibly)  imperfect  information. 


Evolutionary  game  theory  has  been  applied  not  only  to  the  analysis  of  animal  and  insect 
behavior  but  also  to  studying  the  “ most  successful  strategies”  for  tumor  and  cancer  cells  (see, 
for  example,  Gerstung  et  al.,  2011). 
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Part  II  is  devoted  to  games  with  cardinal  payoffs,  that  is,  with  games  where  the 
players’  preferences  extend  to  uncertain  prospects  or  lotteries:  players  are 
assumed  to  have  a consistent  ranking  of  the  set  of  lotteries  over  basic  outcomes. 
Chapter  4 reviews  the  theory  of  expected  utility,  Chapter  5 discusses  the  notion 
of  mixed  strategy  in  strategic-form  games  and  of  mixed-strategy  Nash 
equilibrium  , while  Chapter  6 deals  with  mixed  strategies  in  dynamic  games. 

Parts  III,  IV  and  V cover  a number  of  advanced  topics. 

Part  III  deals  with  the  notions  of  knowledge,  common  knowledge  and  belief. 
Chapter  7 explains  how  to  model  what  an  individual  knows  and  what  she  is 
uncertain  about  and  how  to  extend  the  analysis  to  the  interactive  knowledge  of 
several  individuals  (e.g.  what  Individual  1 knows  about  what  Individual  2 
knows  about  some  facts  or  about  the  state  of  knowledge  of  Individual  1).  The 
chapter  ends  with  the  notion  of  common  knowledge.  Chapter  8 adds 
probabilistic  beliefs  to  the  knowledge  structures  of  the  previous  chapter  and 
discusses  the  notions  of  Bayesian  updating,  belief  revision,  like-mindedness  and 
the  possibility  of  “agreeing  to  disagree”.  Chapter  9 uses  the  interactive 
knowledge-belief  structures  of  the  previous  two  chapters  to  model  the  players’ 
state  of  mind  in  a possible  play  of  a given  game  and  studies  the  implications  of 
common  knowledge  of  rationality  in  strategic-form  games. 

Part  IV  focuses  on  dynamic  (or  extensive-form)  games  and  on  the  issue  of  how 
to  refine  the  notion  of  subgame-perfect  equilibrium  (which  was  introduced  in 
Chapters  3 and  6).  Chapter  10  introduces  a simple  notion,  called  weak 
sequential  equilibrium,  which  achieves  some  desirable  goals  (such  as  the 
elimination  of  strictly  dominated  choices)  but  fails  to  provide  a refinement  of 
subgame-perfect  equilibrium.  Chapter  11  explains  the  more  complex  notion  of 
sequential  equilibrium,  which  is  extensively  used  in  applications  of  game 
theory.  That  notion,  however,  leaves  to  be  desired  from  a practical  point  of 
view  (it  is  typically  hard  to  show  that  an  equilibrium  is  indeed  a sequential 
equilibrium)  and  also  from  a conceptual  point  of  view  (it  appeals  to  a 
topological  condition,  whose  interpretation  is  not  clear).  Chapter  12 
introduces  an  intermediate  notion,  called  perfect  Bayesian  equilibrium,  whose 
conceptual  justification  is  anchored  in  the  so  called  AGM  theory  of  belief 
revision,  extensively  studied  in  philosophy  and  computer  science,  which  was 
pioneered  by  Carlos  Alchourron  (a  legal  scholar),  Peter  Gardenfors  (a 
philosopher)  and  David  Makinson  (a  computer  scientist)  in  1985.  In  Chapter  12  we 
also  provide  an  alternative  characterization  of  sequential  equilibrium  based  on  the 
notion  of  perfect  Bayesian  equilibrium,  which  is  free  of  topological  conditions. 
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Part  V deals  with  the  so-called  “theory  of  games  of  incomplete  information”, 
which  was  pioneered  by  John  Harsanyi  (1967-68).  This  theory  is  usually 
explained  using  the  so-called  “type-space”  approach  suggested  by  Harsanyi. 
However,  we  follow  a different  approach:  the  so-called  “state-space”  approach, 
which  makes  use  of  the  interactive  knowledge-belief  structures  developed  in 
Part  III.  We  find  this  approach  both  simpler  and  more  elegant.  For 
completeness,  in  Chapter  15  we  explain  the  commonly  used  type-based 
structures  and  show  how  to  convert  a state-space  structure  into  a type-space 
structure  and  vice  versa  (the  two  approaches  are  equivalent).  Chapter  13  deals 
with  situations  of  incomplete  information  that  involve  static  (or  strategic-form) 
games,  while  Chapter  14  deals  with  situations  of  incomplete  information  that 
involve  dynamic  (or  strategic-form)  games. 

Appendix  E (‘E’  for  ‘exercises’)  at  the  end  of  each  chapter  contains  several 
exercises,  which  are  divided  according  the  that  chapter’s  sections.  Indeed,  at  the 
end  of  each  chapter  section  the  reader  is  invited  to  go  to  the  appendix  and  try 
the  exercises  for  that  section.  Complete  and  detailed  answers  for  each  exercise 
can  be  found  in  Appendix  S (‘S’  for  ‘solutions’)  at  the  end  of  each  chapter.  For 
each  chapter,  the  set  of  exercises  culminates  into  a “challenging  question”, 
which  is  more  difficult  and  more  time  consuming  than  the  other  exercises.  In 
game  theory,  as  in  mathematics  in  general,  it  is  essential  to  test  one’s 
understanding  of  the  material  by  attempting  to  solve  exercises  and  problems. 
The  reader  is  encouraged  to  attempt  solving  exercises  after  the  introduction  of 
every  new  concept. 
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Chapter 


Ordinal  games  in  strategic  form 


1.1.  Game  frames  and  games 

Ggame  theory  deals  with  interactive  situations  where  two  or  more 
individuals,  called  players,  make  decisions  that  jointly  determine  the  final 
outcome.  To  see  an  example  point  your  browser  to  the  following  video 

https:/  / www.youtube.com/  watchPv = tBtr8-VMjOE 

(if  you  search  for  ‘Split  or  Steal’  on  youtube.com  you  will  find  several  instances  of 
this  game).  The  video  shows  an  excerpt  from  Golden  Balls,  a British  daytime  TV 
game  show.  In  it  each  of  two  players,  Sarah  and  Steve,  has  to  pick  one  of  two  balls: 
inside  one  ball  appears  the  word  ‘split’  and  inside  the  other  the  word  ‘steal’  (each 
player  is  first  asked  to  secretly  check  which  of  the  two  balls  in  front  of  him/her  is 
the  split  ball  and  which  is  the  steal  ball).  They  make  their  decisions  simultaneously. 
The  possible  outcomes  are  shown  in  Table  1.1,  where  each  row  is  labeled  with  a 
possible  choice  for  Sarah  and  each  column  with  a possible  choice  for  Steven.  Each 
cell  in  the  table  thus  corresponds  to  a possible  pair  of  choices  and  the  resulting 
outcome  is  written  inside  the  cell. 


Steven 


Sarah 


Split 

Steal 


Split 

Steal 

Sarah  gets 

Steven  gets 

Sarah  gets 

Steven  gets 

$50,000 

$50,000 

nothing 

$100,000 

Sarah  gets 

Steven  gets 

Sarah  gets 

Steven  gets 

$100,000 

nothing 

nothing 

nothing 

Table  1.1 
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What  should  a rational  player  do  in  such  a situation?  It  is  tempting  to  reason  as 
follows. 


Let  us  focus  on  Sarah’s  decision  problem.  She  realizes  that 
her  decision  alone  is  not  sufficient  to  determine  the  outcome; 
she  has  no  control  over  what  Steven  will  choose  to  do. 

However,  she  can  envision  two  scenarios:  one  where  Steven 
chooses  Steal  and  the  other  where  he  chooses  Split. 

• If  Steven  decides  to  Steal,  then  it  does  not  matter  what 
Sarah  does,  because  she  ends  up  with  nothing,  no  matter 
what  she  chooses. 

• If  Steven  picks  Split,  then  Sarah  will  get  either  $50,000  (if 

she  also  picks  Split)  or  $100,000  (if  she  picks  Steal) . 

Thus  Sarah  should  choose  Steal. 

The  trouble  with  the  above  argument  is  that  it  is  not  a valid  argument  because  it 
makes  an  implicit  assumption  about  how  Sarah  ranks  the  outcomes,  namely  that 
she  is  selfish  and  greedy,  which  may  or  may  not  be  correct.  Let  us  denote  the 
outcomes  as  follows: 


o,  : Sarah  gets  $50, 000  and  Steven  gets  $50, 000. 

o2 : Sarah  gets  nothing  and  Steven  gets  $100, 000. 
o3 : Sarah  gets  $100, 000  and  Steven  gets  nothing. 
o4  : Sarah  gets  nothing  and  Steven  gets  nothing. 

Table  1.2 

If,  indeed,  Sarah  is  selfish  and  greedy  - in  the  sense  that,  in  evaluating  the  outcomes, 
she  focuses  exclusively  on  what  she  herself  gets  and  prefers  more  money  to 
less  - then  her  ranking  of  the  outcomes  is  as  follows:  o3  A o,  A o2  ~oA  (which  reads 
‘o3  is  better  than  o, , ol  is  better  than  o2  and  o2  is  just  as  good  as  of).  But  there 
are  other  possibilities.  For  example,  Sarah  might  be  fair-minded  and  view  the 
outcome  where  both  get  $50,000  as  better  than  all  the  other  outcomes.  For 
instance,  her  ranking  could  be  ol  >-  o3  >-  o2  >-  o4 ; according  to  this  ranking,  besides 
valuing  fairness,  she  also  displays  benevolence  towards  Steven,  in  the  sense  that  - 
when  comparing  the  two  outcomes  where  she  gets  nothing,  namely  o2  and  o4  - 
she  prefers  the  one  where  at  least  Steven  goes  home  with  some  money.  If,  in  fact, 
Sarah  is  fair-minded  and  benevolent,  then  the  logic  underlying  the  above  argument 
would  yield  the  opposite  conclusion,  namely  that  she  should  choose  Split. 
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Thus  we  cannot  presume  to  know  the  answer  to  the  question  “What  is  the  rational 
choice  for  Sarah?”  if  we  don’t  know  what  her  preferences  are.  It  is  a common 
mistake  (unfortunately  one  that  even  game  theorists  sometimes  make)  to  reason 
under  the  assumption  that  players  are  selfish  and  greedy.  This  is,  typically,  an 
unwarranted  assumption.  Research  in  experimental  psychology,  philosophy  and 
economics  has  amply  demonstrated  that  many  people  are  strongly  motivated  by 
considerations  of  fairness.  Indeed,  fairness  seems  to  motivate  not  only  humans  but 

also  primates,  as  shown  in  the  following  video: 

http://www.ted.com/talks/frans  de  waal  do  animals  have  morals 

The  situation  illustrated  in  Table  1.1  is  not  a game  as  we  have  no  information  about 
the  preferences  of  the  players;  we  use  the  expression  game-frame  to  refer  to  it.  In 
the  case  where  there  are  only  two  players  and  each  player  has  a small  number  of 
possible  choices  (also  called  strategies),  a game-frame  can  be  represented  - as  we  did 
in  Table  1.1  - by  means  of  a table,  with  as  many  rows  as  the  number  of  possible 
strategies  of  Player  1 and  as  many  columns  as  the  number  of  strategies  of  Player  2; 
each  row  is  labeled  with  one  strategy  of  Player  1 and  each  column  with  one  strategy 
of  Player  2;  inside  each  cell  of  the  table  (which  corresponds  to  a pair  of  strategies, 
one  for  Player  1 and  one  for  Player  2)  we  write  the  corresponding  outcome. 

Before  presenting  the  definition  of  game-frame,  we  remind  the  reader  of  what  the 
Cartesian  product  of  two  or  more  sets  is.  Let  Sl  and  S2  be  two  sets.  Then  the 
Cartesian  product  of  Sl  and  S2 , denoted  by  S}  x S2 , is  the  set  of  ordered  pairs 
(Xj,x2)  where  x,  is  an  element  of  Sl  ( x,  e .S’, ) and  x2  is  an  element  of  S2 
(x2  eS2).  For  example,  if  Sl  = {a,b,c}  and  S2  = {D,E}  then 

S,  x S2  = {(a,D),  ( a,E ),  (b,  D ),  (b,  E),  (c,  D),  (c,  E)}  . 

The  definition  extends  to  the  general  case  of  n sets  (n  > 2):  an  element  of 
SlxS2x...xSn  is  an  ordered  n-tuple  (x, , x2 , . . . , xn ) where,  for  each  i = 
x,.  e Si . 

The  definition  of  game-frame  is  as  follows. 


1 Also  available  at  https:/ / www.youtube.com/ watch?v  = GcJxRqTs5nk 
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Definition  1.1.  A game-frame  in  strategic  form  is  a list  of  four  items  (a  quadruple) 
{l,{Sx,S2,...,Sn),Oj)  where: 

• / = {1,2,. is  a set  of  players  ( n > 2). 

• {St,S2,...,Sn)  is  a list  of  sets,  one  for  each  player.  For  every  Player  / e / , 5,  is 

the  set  of  strategies  (or  possible  choices)  of  Player  1.  We  denote  by  S the  Cartesian 
product  of  these  sets:  S = SlxS2x...xSn;  thus  an  element  of  S is  a list 
s = (sl,s2,...,sn)  consisting  of  one  strategy  for  each  player.  We  call  S the  set  of 
strategy  profiles. 

• O is  a set  of  outcomes. 

• / : S — » O is  a function  that  associates  with  every  strategy  profile  s an  outcome 
flfeO. 

Using  the  notation  of  Definition  1.1,  the  situation  illustrated  in  Table  1.1  is  the 
following  game-frame  in  strategic  form: 

• I = {1,2}  (letting  Sarah  be  Player  1 and  Steven  Player  2), 

• (Sj, S2 ) = ( {Split,  Steal} , {Split,  Steal}) ; thus  A,  = S2  = {Split,  Steal} , so  that  the 
set  of  strategy  profiles  is  S = {{Split,  Split),  {Split,  Steal),  {Steal,  Split),  {Steal,  Steal)) , 

• O is  the  set  of  outcomes  listed  in  Table  1.2, 

• / is  the  following  function: 

s\  {Split,  Split)  {Split,  Steal)  {Steal,  Split)  {Steal ',  Steal) 
f{s) : ox  o2  o3  o4 

(that  is,  f {{Split,  Split))  = o, , / {{Split, Steal))  = o2 , etc.). 

From  a game-frame  one  obtains  a game  by  adding,  for  each  player,  her  preferences 
over  (or  ranking  of)  the  possible  outcomes.  We  use  the  following  notation. 

Meaning 

Player  1 considers  outcome  o to  be  better  than  outcome  o’ 

Player  i considers  o to  be  just  as  good  as  o’ 

(that  is,  Player  i is  indifferent  between  o and  o') 

Player  i considers  o to  be  at  least  as  good  as  o’ 

(that  is,  either  better  than  or  just  as  good  as) 

Table  1.3 


Notation 


o y.  o 


o ~ . o 


. o' 
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For  example,  if  M denotes  ‘Mexican  food’  and  / ‘Japanese  food’,  M >~Alice  J means 
that  Alice  prefers  Mexican  food  to  Japanese  food  and  M ~Bob  J means  that  Bob  is 
indifferent  between  the  two. 

Remark  1.1.  The  “at  least  as  good”  relation  <:  is  sufficient  to  capture  also  strict 
preference  x and  indifference  ~ . In  fact,  starting  from  T , one  can  define  strict 
preference  as  follows:  o X o'  if  and  only  if  o (.  o’  and  o'  T t o and  one  can  define 
indifference  as  follows:  o ~(.  o'  if  and  only  if  o T . o'  and  o'  o. 

We  will  assume  throughout  this  book  that  the  “at  least  as  good”  relation  <:  i of 
Player  i - which  embodies  her  preferences  over  (or  ranking  of)  the  outcomes  - is 

complete  (for  every  two  outcomes  ox  and  o2,  either  ol  T ; o2  or  o2  t oj  and 

2 

transitive  (if  o,  T o2  and  o2  t o3  then  o,  <:  (.  o3). 

There  are  (at  least)  three  ways  of  representing,  or  expressing,  a complete  and 
transitive  ranking  of  a set  of  outcomes  O.  For  example,  suppose  that 
O = [ol,o2,o3,o4,o5]  and  that  we  want  to  represent  the  following  ranking:  o3  is 
the  best  outcome,  it  is  better  than  o5 , which  is  just  as  good  as  ol , ol  is  better  than 
o4 , which,  in  turn,  is  better  than  o2  (so  that  o2  is  the  worst  outcome).  We  can 
represent  this  ranking  in  one  of  the  following  ways. 

1.  By  using  the  notation  of  Table  1.3:  o3  A os  ~ o1  X o4  A o2 . 

2.  By  listing  the  outcomes  in  a column,  starting  with  the  best  at  the  top  and 
proceeding  down  to  the  worst,  thus  using  the  convention  that  if  outcome  o is 
listed  above  outcome  o'  then  o is  preferred  to  o' , while  if  o and  o'  are 
written  next  to  each  other,  then  they  are  considered  to  be  just  as  good: 

best  o3 

°l’°S 

°4 

worst  o2 


2 Transitivity  of  the  “at  least  as  good”  relation  >;  implies  transitivity  of  the  indifference  relation 
(if  o,  ~ and  o,  ~ o then  o,  ~o3)  as  well  as  transitivity  of  the  strict  preference  relation  (not 

only  in  the  sense  that  (1)  if  ol  >-  o,  and  >-  o3  then  o , >-  o3 , but  also  that  (2)  if  o , >-  o,  and 
o2  ~ o3  then  0,  >-  o3  and  (3)  if  0,  ~ o2  and  o2  >-  o3  then  0,  y o3)- 
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3.  By  assigning  a number  to  each  outcome,  with  the  convention  that  if  the 
number  assigned  to  o is  greater  than  the  number  assigned  to  o’  then  o is  preferred 
to  o' , and  if  two  outcomes  are  assigned  the  same  number  then  they  are 
considered  to  be  just  as  good.  For  example,  we  could  choose  the  following 

numbers:  1 2 3 4 5 . Such  an  assignment  of  numbers  is  called  a 

6 1 8 2 6 8 

utility  function.  A useful  way  of  thinking  of  utility  is  as  an  “index  of 
satisfaction”:  the  higher  the  index  the  better  the  outcome;  however,  this 
suggestion  is  just  to  aid  memory  and  should  be  taken  with  a grain  of  salt, 
because  a utility  function  does  not  measure  anything  and,  furthermore,  as 
explained  below,  the  actual  numbers  used  as  utility  indices  are  completely 
arbitrary.3 


Definition  1.2.  Given  a complete  and  transitive  ranking  of  a finite  set  of 
outcomes  O,  a function  U \0—>  R (where  ffi.  denotes  the  set  of  real  numbers)  is 
said  to  be  an  ordinal  utility  function  that  represents  the  ranking  if,  for  every  two 
outcomes  o and  o' , U(o)>U  (o')  if  and  only  if  o>o'  and  U(o)  = U (o')  if  and 
only  if  o -o' . The  number  U (o)  is  called  the  utility  of  outcome  o . 


Remark  1.2.  Note  that  the  statement  “for  Alice  the  utility  of  Mexican  food  is  10”  is 
in  itself  a meaningless  statement;  on  the  other  hand,  what  would  be  a meaningful 
statement  is  “for  Alice  the  utility  of  Mexican  food  is  10  and  the  utility  of  Japanese 
food  is  5”,  because  such  a statement  conveys  the  information  that  she  prefers 
Mexican  food  to  Japanese  food.  However,  the  two  numbers  10  and  5 have  no  other 
meaning  besides  the  fact  that  10  is  greater  than  5:  for  example  we  cannot,  and 
should  not,  infer  from  these  numbers  that  she  considers  Mexican  food  twice  as 
good  as  Japanese  food.  The  reason  for  this  is  that  we  could  have  expressed  the  same 
fact,  namely  that  she  prefers  Mexican  food  to  Japanese  food,  by  assigning  utility  100 
to  Mexican  food  and  -25  to  Japanese  food,  or  with  any  other  two  numbers. 


3 Note  that  assigning  a utility  of  1 to  an  outcome  o does  not  mean  that  o is  the  “first  choice”. 
Indeed,  in  this  example  a utility  of  1 is  assigned  to  the  worst  outcome:  o,  is  the  worst  outcome 

because  it  has  the  lowest  utility  (which  happens  to  be  1,  in  this  example). 
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It  follows  from  the  above  remark  that  there  is  an  infinite  number  of  utility 
functions  that  represent  the  same  ranking.  For  instance,  the  following  are 
equivalent  ways  of  representing  the  ranking  o3  >-  ox  >-  o2  ~ o4  if,  g and  h are  three 
out  of  the  many  possible  utility  functions): 


outcome  -» 
utility  function  f 

°i 

°2 

°3 

°4 

f 

5 

2 

10 

2 

g 

0.8 

0.7 

1 

0.7 

h 

27 

1 

100 

1 

Utility  functions  are  a particularly  convenient  way  of  representing  preferences.  In 
fact,  by  using  utility  functions  one  can  give  a more  condensed  representation  of 
games,  as  explained  in  the  last  paragraph  of  the  following  definition. 

Definition  1.3.  An  ordinal  game  in  strategic  form  is  a quintuple 

„))  where: 

• ( I , ( Sj , . . . , Sn ) , O , f } is  a game-frame  in  strategic  form  (Definition  1 . 1)  and 

• for  every  Player  i el , . is  a complete  and  transitive  ranking  of  the  set  of 

outcomes  O. 

If  we  replace  each  ranking  ( with  a utility  function  Ut  that  represents  it,  and  we 
assign,  to  each  strategy  profile  s,  Player  i ’s  utility  of  / (5)  (recall  that  / (5)  is  the 
outcome  associated  with  s)  then  we  obtain  a function  ni : S — > R called  Player  is 

4 

payoff  function.  Thus  Tifs  ) = Ut  ( / (.s’ ) ) . Having  done  so,  we  obtain  a triple 

^_7,  ( , .. .,  S n ) , ( tt,  , . . . , 7r n called  a reduced-form  ordinal  game  in  strategic  form 

(‘reduced-form’  because  some  information  is  lost,  namely  the  specification  of  the 
possible  outcomes). 

For  example,  take  the  game-frame  illustrated  in  Table  1.1,  let  Sarah  be  Player  1 and 
Steven  Player  2 and  name  the  possible  outcomes  as  shown  in  Table  1.2.  Let  us  add 
the  information  that  both  players  are  selfish  and  greedy  (that  is,  Player  l’s  ranking 
is  0,  A,  o,  >-,  o2  o4  and  Player  2’s  ranking  is  o2  >-2  o,  a2  o3  ~2  of)  and  let  us 
represent  their  rankings  with  the  following  utility  functions  (note,  again,  that  the 


4 Note  that,  in  this  book,  the  symbol  n is  not  used  to  denote  the  irrational  number  used  to 
compute  the  circumference  and  area  of  a circle,  but  rather  as  the  Greek  letter  for  ‘p’  which 
stands  for  ‘payoff’. 
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choice  of  numbers  2,  3 and  4 for  utilities  is  arbitrary:  any  other  three  numbers 
would  do): 


outcome  —> 

0\  ^2  ^3  ^4 

utility  function  4 

Ul  (Player  1) 

3 2 4 2 

U2  (Player  2) 

3 4 2 2 

Then  we  obtain  the  following  reduced-form  game,  where  in  each  cell  the  first 
number  is  the  payoff  of  Player  f and  the  second  number  is  the  payoff  of  Player  2. 


Player  1 
(Sarah)  Steal 


Table  1.4 

On  the  other  hand,  if  we  add  to  the  game-frame  of  Table  1.1  the  information  that 
Player  1 is  fair-minded  and  benevolent  (that  is,  her  ranking  is  o,  A,  o3  >-,  o2  >-,  o, ), 
while  Player  2 is  selfish  and  greedy  and  represent  these  rankings  with  the  following 
utility  functions, 


Player  2 (Steven) 
Split  Steal 

3 3 2 4 


4 2 2 2 


outcome  — > 
utility  function  4 

°2 

°3  °4 

Ul  (Player  1) 

4 

2 

3 

1 

U2  (Player  2) 

3 

4 

2 

2 

then  we  obtain  the  following  reduced-form  game: 


Player  1 ^P^ 

(Sarah)  Steal 


Table  1.5 


Player  2 (Steven) 
Split  Steal 


4 3 

2 4 

3 2 

1 2 
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In  general,  a player  will  act  differently  in  different  games,  even  if  they  are  based  on 
the  same  game-frame,  because  her  incentives  and  objectives  (as  captured  by  her 
ranking  of  the  outcomes)  will  be  different.  For  example,  one  can  argue  that  in  the 
game  of  Table  1.4  a rational  Player  1 would  choose  Steal,  while  in  the  game  of 
Table  1.5  the  rational  choice  for  Player  1 is  Split. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  l.E.l  of  Appendix  l.E  at  the 
end  of  this  chapter. 


1.2.  Strict  and  weak  dominance 

In  this  section  we  define  two  relations  on  the  set  of  strategies  of  a player.  Before 
introducing  the  formal  definition,  we  shall  illustrate  these  notion  with  an  example. 
The  first  relation  is  called  “strict  dominance”.  Let  us  focus  our  attention  on  one 
player,  say  Player  1,  and  select  two  of  her  strategies,  say  a and  b.  We  say  that  a 
strictly  dominates  b (for  Player  1)  if,  for  every  possible  strategy  profile  of  the  other 
players,  strategy  a of  Player  1,  in  conjunction  with  the  strategies  selected  by  the 
other  players,  yields  a payoff  for  Player  1 which  is  greater  than  the  payoff 
associated  with  strategy  b (in  conjunction  with  the  strategies  selected  by  the  other 
players).  For  example,  consider  the  following  two-player  game,  where  only  the 
payoffs  of  Player  1 are  shown: 


Player  2 


E F G 

A 

B 

Player  1 

C 

D 


3 ... 

2 ... 

1 ... 

2 ... 

1 ... 

0 ... 

3 ... 

2 ... 

1 ... 

2 ... 

0 ... 

0 ... 

Table  1.6 


In  this  game  for  Player  1 strategy  A strictly  dominates  strategy  B: 

• if  Player  2 selects  E then  A in  conjunction  with  E gives  Player  1 a payoff  of 
3,  while  B in  conjunction  with  E gives  her  only  a payoff  of  2, 
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• if  Player  2 selects  F then  A in  conjunction  with  F gives  Player  1 a payoff  of 
2,  while  B in  conjunction  with  F gives  her  only  a payoff  of  1, 

• if  Player  2 selects  G then  A in  conjunction  with  G gives  Player  1 a payoff  of 

1,  while  B in  conjunction  with  G gives  her  only  a payoff  of  0. 

In  the  game  of  Table  1.6  we  also  have  that  A strictly  dominates  D and  C strictly 
dominates  D;  however,  it  is  not  the  case  that  B strictly  dominates  D because,  in 
conjunction  with  strategy  E of  Player  2,  B and  D yield  the  same  payoff  for  Player  1. 

The  second  relation  is  called  “weak  dominance”.  The  definition  is  similar  to  that  of 
strict  dominance,  but  we  replace  ‘greater  than’  with  ‘greater  than  or  equal  to’ 
while  insisting  on  at  least  one  strict  inequality:  a weakly  dominates  b (for  Player  1) 
if,  for  every  possible  strategy  profile  of  the  other  players,  strategy  a of  Player  1,  in 
conjunction  with  the  strategies  selected  by  the  other  players,  yields  a payoff  for 
Player  1 which  is  greater  than  or  equal  to  the  payoff  associated  with  strategy  b (in 
conjunction  with  the  strategies  selected  by  the  other  players)  and,  furthermore, 
there  is  at  least  one  strategy  profile  of  the  other  players  against  which  strategy  a 
gives  a larger  payoff  to  Player  1 than  strategy  b.  In  the  example  of  Table  1.6,  we 
have  that,  while  it  is  not  true  that  B strictly  dominates  D,  it  is  true  that  B weakly 
dominates  D\ 

• if  Player  2 selects  E,  then  B in  conjunction  with  E gives  Player  1 the  same 
payoff  as  D in  conjunction  with  E (namely  2), 

• if  Player  2 selects  F,  then  B in  conjunction  with  F gives  Player  1 a payoff  of 

2,  while  B in  conjunction  with  F gives  her  only  a payoff  of  1, 

• if  Player  2 selects  G then  B in  conjunction  with  G gives  Player  1 the  same 
payoff  as  D in  conjunction  with  G (namely  0). 

In  order  to  give  the  definitions  in  full  generality  we  need  to  introduce  some 
notation.  Recall  that  S denotes  the  set  of  strategy  profiles,  that  is,  an  element  s of  S is 
an  ordered  list  of  strategies  s = (sl,...,sn) , one  for  each  player.  We  will  often  want 
to  focus  on  one  player,  say  Player  i,  and  view  s as  a pair  consisting  of  the  strategy  of 
Player  i and  the  remaining  strategies  of  all  the  other  players.  For  example,  suppose 
that  there  are  three  players  and  the  strategy  sets  are  as  follows:  Sl={a,b,c], 
S2  = {d,e}  and  S3  = {f,g}  . Then  one  possible  strategy  profile  is  s = ( b,d,g ) (thus 
sx=b  , s2=  d and  s3  = g)  . If  we  focus  on,  say,  Player  2 then  we  will  denote  by 
s_2  the  sub-profile  consisting  of  the  strategies  of  the  players  other  than  2:  in  this  case 
s_2  =(b,g).  This  gives  us  an  alternative  way  of  denoting  s,  namely  as  (s2,s  2 ). 
Continuing  our  example  where  s = ( b,d,g ),  letting  s_2  = ( b,g ) , we  can  denote  s 
also  by  (d,s_2)  and  we  can  write  the  result  of  replacing  Player  2’s  strategy  d with 
her  strategy  e in  s by  ( e,s_2 ) ; thus  ( d,s_2 ) = ( b,d,g ) while  ( e,s_2 ) = ( b,e,g ).  In 
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general,  given  a Player  i,  we  denote  by  S . the  set  of  strategy  profiles  of  the  players 
other  than  i (that  is,  S is  the  Cartesian  product  of  the  strategy  sets  of  the  other 
players;  in  the  above  example  we  have  that 
S_2=SlxSi={a,b,c}x{f,g}  = {(a,  /),  (a,  g ),  (b,  /),  ( b , g),  (c,  /),  (c,  g)} . We 

denote  an  element  of  S by  s_t . 

Definition  1.4.  Given  an  ordinal  game  in  strategic  form,  let  i be  a Player  and  a and 
b two  of  her  strategies  (a,b  e 5 ).  We  say  that,  for  Player  i, 

• a strictly  dominates  b (or  b is  strictly  dominated  by  a)  if,  in  every  situation  (that  is, 
no  matter  what  the  other  players  do),  a gives  Player  i a payoff  which  is  greater 

than  the  payoff  that  b gives.  Formally:  for  every  s_t  e S_t , Tri(ci,s_i)  > 7ri(b,s  /) . 

• a weakly  dominates  b (or  b is  weakly  dominated  by  a)  if,  in  every  situation,  a gives 
Player  i a payoff  which  is  greater  than  or  equal  to  the  payoff  that  b gives  and, 
furthermore,  there  is  at  least  one  situation  where  a gives  a greater  payoff  than  b. 
Formally:  for  every  s_t  e S ( , 71^0, s_t)  > 7Ti(b,s  j)  and  there  exists  an  W e S . 

such  that  7ri(a,Ji)  > 7ii(b,J  i)  . 

• a is  equivalent  to  b if,  in  every  situation,  a and  b give  Player  1 the  same  payoff. 
Formally:  for  every  s_t  e S , 7rj(a,s_i)  = 7ii(b,s  i) . 


A 
B 

1 

C 
D 

For  example,  in  the  game  of  Table  1.6  reproduced  above,  we  have  that 

• A strictly  dominates  B. 

• A and  C are  equivalent. 

• A strictly  dominates  D. 

• B is  strictly  dominated  by  C. 


Player  2 


E 

F 

G 

3 

2 

1 

2 

1 

0 

3 

2 

1 

2 

0 

0 

5 Or,  stated  in  terms  of  rankings  instead  of  payoffs,  /(a,5_,)  y f(b,s_i)  for  every  5_;  e . 

6 Or,  stated  in  terms  of  rankings,  f{a,s_t)C  , f(b,s_t) , for  every  s e5_;,  and  there  exists  an 

s_t  e S_t  such  that  . 

7 Or,  stated  in  terms  of  rankings,  /(«,>?_,)  for  every  s_f  e . 
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• B weakly  (but  not  strictly)  dominates  D. 

• C strictly  dominates  D. 

Remark  1.3.  Note  that  if  strategy  a strictly  dominates  strategy  b then  it  also 
satisfies  the  conditions  for  weak  dominance,  that  is,  ‘ a strictly  dominates  b ’ implies 
‘a  weakly  dominates  b\  Throughout  the  book  the  expression  ‘a  weakly  dominates 
b ’ will  be  interpreted  as  ‘ a dominates  b weakly  but  not  strictly  ’. 

The  expression  ‘a  dominates  b’  can  be  understood  as  ‘a  is  better  than  b\  The  next 
term  we  define  is  ‘dominant’  which  can  be  understood  as  ‘ best’.  Thus  one  cannot 
meaningfully  say  “a  dominates”  because  one  needs  to  name  another  strategy  that  is 
dominated  by  a;  for  example,  one  would  have  to  say  “a  dominates  b”.  On  the  other 
hand,  one  can  meaningfully  say  “a  is  dominant”  because  it  is  like  saying  “a  is  best”, 
which  means  “a  is  better  than  every  other  strategy”. 

Definition  1.5.  Given  an  ordinal  game  in  strategic  form,  let  i be  a Player  and  a one 
of  her  strategies  {a  e S).  We  say  that,  for  Player  i, 

• a is  a strictly  dominant  strategy  if  a strictly  dominates  every  other  strategy  of 
Player  i. 

• a is  a weakly  dominant  strategy  if,  for  every  other  strategy  x of  Player  z,  one  of  the 

following  is  true:  (1)  either  a weakly  dominates  x or  (2)  a is  equivalent  to  x. 

For  example,  in  the  game  shown  in  Table  1.6,  A and  C are  both  weakly  dominant 
strategies  for  Player  1.  Note  that  if  a player  has  two  or  more  strategies  that  are 
weakly  dominant,  then  any  two  of  those  strategies  must  be  equivalent.  On  the 
other  hand,  there  can  be  at  most  one  strictly  dominant  strategy. 

Remark  1.4.  The  reader  should  convince  herself/himself  that  the  definition  of 
weakly  dominant  strategy  given  in  Definition  1.5  is  equivalent  to  the  following: 

a e 5 is  a weakly  dominant  strategy  for  Player  i if  and  only  if,  for  every  s_t  e S , 

g 

7rj(a,s_j ) > ni(si,s_i)  for  every  y e St . 

In  accordance  with  the  convention  established  in  Remark  1.3,  the  expression  ‘a  is  a 
weakly  dominant  strategy’  will  have  the  default  interpretation  ‘a  is  a weakly  but  not 
strictly  dominant  strategy’. 

Note:  if  you  claim  that,  for  some  player,  “strategy  x is  (weakly  or  strictly) 
dominated”  then  you  ought  to  name  another  strategy  of  that  player  that  dominates 
x.  Saying  “x  is  dominated”  is  akin  to  saying  “x  is  worse”:  worse  than  what?  On  the 


Or,  stated  in  terms  of  rankings,  for  every  s_t  e S_t , / (a,  s_:)  <Z  t f (y. ,,  s_t)  for  every  y e Y . 
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other  hand,  claiming  that  strategy  y is  weakly  dominant  is  akin  to  claiming  that  it  is 
best,  that  is,  better  than,  or  just  as  good  as,  any  other  strategy. 


Definition  1.6.  Given  an  ordinal  game  in  strategic  form,  let  s = (sl,...,sn)  be  a 
strategy  profile.  We  say  that 

• s is  a strict  dominant-strategy  equilibrium  if,  for  every  Player  i,  y is  a strictly 
dominant  strategy. 

• s is  a weak  dominant-strategy  equilibrium  if,  for  every  Player  i,  y.  is  a weakly 
dominant  strategy  and,  furthermore,  for  at  least  one  Player  j , s . is  not  a strictly 
dominant  strategy. 

If  we  refer  to  a strategy  profile  as  a dominant-strategy  equilibrium,  without 
qualifying  it  as  weak  or  strict,  then  the  default  interpretation  will  be  ‘weak’. 

In  the  game  of  Table  1.4,  reproduced  below,  Steal  is  a weakly  dominant  strategy  for 
each  player  and  thus  {Steal, Steal)  is  a weak  dominant-strategy  equilibrium. 


Player  1 
(Sarah) 


Player  2 (Steven) 
Split  Steal 

Split 
Steal 


In  the  game  of  Table  1.5,  reproduced  below,  Split  is  a strictly  dominant  strategy  for 
Player  1,  while  Steal  is  a weakly  (but  not  strictly)  dominant  strategy  for  Player  2 
and  thus  {Split, Steal)  is  a weak  dominant-strategy  equilibrium. 


Player  1 
(Sarah) 


Split 

Steal 


Player  2 (Steven) 
Split  Steal 


4 3 

2 4 

3 2 

1 2 

The  Prisoners  Dilemma  is  an  example  of  a game  with  a strict  dominant-strategy 
equilibrium.  For  a detailed  account  of  the  history  of  this  game  and  an  in-depth 
analysis  of  it  see 

http:/ / plato.stanford.edu/ entries/ prisoner-dilemma  or 
http:/ / en.wikipedia.org/ wiki/Prisoner's  dilemma. 
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An  instance  of  the  Prisoner’s  Dilemma  is  the  following.  Doug  and  Ed  work  for  the 
same  company  and  the  annual  party  is  approaching.  They  know  that  they  are  the 
only  two  candidates  for  the  best-worker-of-the-year  prize  and  at  the  moment  they 
are  tied;  however,  only  one  person  can  be  awarded  the  prize  and  thus,  unless  one  of 
them  manages  to  outperform  the  other,  nobody  will  receive  the  prize.  Each  chooses 
between  exerting  Normal  effort  or  Extra  effort  (that  is,  work  overtime)  before  the 
party.  The  game-frame  is  as  shown  in  Table  1.7  below. 


Player  1 
(Doug) 


Normal 

effort 

Extra 

effort 


Player  2 (Ed) 
Normal  Extra 

effort  effort 


°2 

°4 

o,  : nobody  gets  the  prize  and  nobody  sacrifices  family  time 

o2 : Ed  gets  the  prize  and  sacrifices  family  time,  Doug  does  not 
o3 : Doug  gets  the  prize  and  sacrifices  family  time,  Ed  does  not 
o4  : nobody  gets  the  prize  and  both  sacrifice  family  time 


Table  1.7 


Suppose  that  both  Doug  and  Ed  are  willing  to  sacrifice  family  time  to  get  the  prize, 
but  otherwise  value  family  time;  furthermore,  they  are  envious  of  each  other,  in  the 
sense  that  they  prefer  nobody  getting  the  prize  to  the  other  person’s  getting  the 
prize  (even  at  the  personal  cost  of  sacrificing  family  time).  That  is,  their  rankings  are 
as  follows:  o3  yDoug  o,  yDoug  o4  yDoug  o2  and  o2  yEd  o,  >Fd  o4  yEd  o3 . Using  utility 

function  with  values  from  the  set  {0,1, 2, 3}  we  can  represent  the  game  in  reduced 
form  as  follows: 


Normal 
Player  1 effort 
(Doug)  Extra 
effort 

Table  1.8 

The  Prisoner’s  Dilemma  game 


Player  2 (Ed) 
Normal  Extra 

effort  effort 

2 2 0 3 


3 0 11 
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In  this  game  exerting  extra  effort  is  a strictly  dominant  strategy  for  every  player; 
thus  ( Extra  effort,  Extra  effort)  is  a strict  dominant-strategy  equilibrium. 

Definition  1.7.  Given  an  ordinal  game  in  strategic  form,  let  o and  o'  be  two 
outcomes.  We  say  that  o is  strictly  Pareto  superior  to  o’  if  every  player  prefers  o 
to  o'  (that  is,  if  o A.  o'  for  every  Player  i).  We  say  that  o is  weakly  Pareto  superior 
to  o'  if  every  player  considers  o to  be  at  least  as  good  as  o'  and  at  least  one  player 
prefers  o to  o'  (that  is,  if  o E . o’  for  every  Player  i and  there  is  a Player  j such  that 
o >-.  o’).  In  reduced-form  games,  this  definition  can  be  extended  to  strategy  profiles 

as  follows.  If  s and  s'  are  two  strategy  profiles,  then  s is  strictly  Pareto  superior  to 
s'  if  /r.(s')  > ftf  s')  for  every  Player  i and  s is  weakly  Pareto  superior  to  s’  if 
nf  s)  > nf  s')  for  every  Player  i and,  furthermore,  there  is  a Player  j such  that 
Ttfs)  > irfs’) . 

For  example,  in  the  Prisoner’s  Dilemma  game  of  Table  1.8,  outcome  o,  is  strictly 
Pareto  superior  to  o4  or,  in  terms  of  strategy  profiles,  ( Normal  effort,  Normal  effort) 
is  strictly  Pareto  superior  to  ( Extra  effort,  Extra  effort). 

When  a player  has  a strictly  dominant  strategy,  it  would  be  irrational  for  that 
player  to  choose  any  other  strategy,  since  she  would  be  guaranteed  a lower  payoff 
in  every  possible  situation  (that  is,  no  matter  what  the  other  players  do).  Thus  in 
the  Prisoner’s  Dilemma  individual  rationality  leads  to  ( Extra  effort,  Extra  effort) 
despite  the  fact  that  both  players  would  be  better  off  if  they  both  chose  Normal 
effort.  It  is  obvious  that  if  the  players  could  reach  a binding  agreement  to  exert 
normal  effort  then  they  would  do  so;  however,  the  underlying  assumption  in  non- 
cooperative  game  theory  is  that  such  agreements  are  not  possible  (e.g.  because  of 
lack  of  communication  or  because  such  agreements  are  illegal  or  cannot  be  enforced 
in  a court  of  law,  etc.).  Any  non-binding  agreement  to  choose  Normal  effort  would 
not  be  viable:  if  one  player  expects  the  other  player  to  stick  to  the  agreement,  then 
he  will  gain  by  cheating  and  choosing  Extra  effort.  On  the  other  hand,  if  a player 
does  not  believe  that  the  other  player  will  honor  the  agreement  then  he  will  gain  by 
deviating  from  the  agreement  herself.  The  Prisoner’s  Dilemma  game  is  often  used 
to  illustrate  a conflict  between  individual  rationality  and  collective  rationality:  (Extra 
effort,  Extra  effort)  is  the  individually  rational  outcome  while  ( Normal  effort,  Normal 
effort)  would  be  the  collectively  rational  one. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  1.E.2  of  Appendix  l.E  at  the 
end  of  this  chapter. 
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1.3.  Second-price  auction 

The  second-price  auction,  or  Vickrey  auction,  is  an  example  of  a game  that  has  a 
weak  dominant-strategy  equilibrium.  It  is  a “sealed-bid”  auction  where  bidders 
submit  bids  without  knowing  the  bids  of  the  other  participants  in  the  auction.  The 
object  which  is  auctioned  is  then  assigned  to  the  bidder  who  submits  the  highest  bid 
(the  winner),  but  the  winner  pays  not  her  own  bid  but  rather  the  second-highest 
bid,  that  is  the  highest  bid  among  the  bids  that  remain  after  removing  the  winner’s 
own  bid.  Tie-breaking  rules  must  be  specified  for  selecting  the  winner  when  the 
highest  bid  is  submitted  by  two  or  more  bidders  (in  which  case  the  winner  ends  up 
paying  her  own  bid,  because  the  second-highest  bid  is  equal  to  the  winner’s  bid). 
We  first  illustrate  this  auction  with  an  example. 

Two  oil  companies  bid  for  the  right  to  drill  a field.  The  possible  bids  are  $10 
million,  $20  million,  ...,  $50  million.  In  case  of  ties  the  winner  is  Player  2 (this  was 
decided  earlier  by  tossing  a coin).  Let  us  take  the  point  of  view  of  Player  1.  Suppose 
that  Player  1 ordered  a geological  survey  and,  based  on  the  report,  concludes  that 
the  oil  field  would  generate  a profit  of  $30  million.  Suppose  also  that  Player  1 is 
indifferent  between  any  two  outcomes  where  the  oil  field  is  given  to  Player  2 and 
prefers  to  get  the  oil  field  herself  if  and  only  if  it  has  to  pay  not  more  than  $30 
million  for  it;  furthermore,  getting  the  oil  field  for  $30  million  is  just  as  good  as  not 
getting  it.  Then  we  can  take  as  utility  function  for  Player  1 the  net  gain  to  Player  1 
from  the  oil  field  (defined  as  profits  from  oil  extraction  minus  the  price  paid  for 
access  to  the  oil  field)  if  Player  1 wins,  and  zero  otherwise.  In  Table  1.9  we  have 
written  inside  each  cell  only  the  payoff  of  Player  1.  For  example,  why  is  Player  l’s 
payoff  20  when  it  bids  $30M  and  Player  2 bids  $10M?  Since  Player  l’s  bid  is  higher 
than  Player  2’s  bid,  Player  1 is  the  winner  and  thus  the  drilling  rights  are  assigned  to 
Player  1;  hence  Player  1 obtains  something  worth  $30M  and  pays,  not  its  own  bid 
of  $30M,  but  the  bid  of  Player  2,  namely  $10M;  thus  Player  l’s  net  gain  is  $(30 
-10)M  = $20M.  It  can  be  verified  that  for  Player  1 submitting  a bid  equal  to  the 
value  it  assigns  to  the  object  (namely,  a bid  of  $30  million)  is  a weakly  dominant 
strategy:  it  always  gives  Player  1 the  largest  of  the  payoffs  that  are  possible,  given 
the  bid  of  the  other  player.  This  does  not  imply  that  it  is  the  only  weakly  dominant 
strategy;  indeed,  in  this  example  bidding  $40M  is  also  a weakly  dominant  strategy 
for  Player  1 (in  fact,  it  is  equivalent  to  bidding  $30M). 
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Player 

1 

(value  $30M) 


Player  2 


$10M 

$20M 

$30M 

$40M 

$50M 

$10M 

0 

0 

0 

0 

0 

$20M 

20 

0 

0 

0 

0 

$30M 

20 

10 

0 

0 

0 

$40M 

20 

10 

0 

0 

0 

$50M 

20 

10 

0 

-10 

0 

Table  1.9 


A second-price  auction  where,  in  case  of  ties,  the  winner  is  Player  2. 


Now  we  can  describe  the  second-price  auction  in  more  general  terms.  Let  n>  2 be 
the  number  of  bidders.  We  assume  that  all  non-negative  bids  are  allowed  and  that 
the  tie-breaking  rule  favors  the  player  with  the  lowest  index  among  those  who 
submit  the  highest  bid:  for  example,  if  the  highest  bid  is  $250  and  it  is  submitted  by 
Players  5,  8 and  10,  then  the  winner  is  Player  5.  We  shall  denote  the  possible 
outcomes  as  pairs  (z,  p) , where  i is  the  winner  and p is  the  price  that  the  winner  has 
to  pay.  Finally  we  denote  by  bi  the  bid  of  Player  i.  We  start  by  describing  the  case 
where  there  are  only  two  bidders  and  then  generalize  to  the  case  of  an  arbitrary 
number  of  bidders.  We  denote  the  set  of  non-negative  numbers  by  [0,oo) . 

The  case  where  n = 2:  in  this  case  we  have  that  / = {1,2},  Sl  = S2  =[0,oo), 
O = {(/,  p):i  e {1, 2} , p e [0,  oo)}  and  / : S — » O is  given  by 

j(l ,b2)  if  bx  > b2 
1(2,  A, ) if  bx  < b2 


The  case  where  n>  2:  in  the  general  case  the  second-price  auction  is  the  following 

game-frame: 

• I = n} 

• Si  = [0,oo)  for  every  i = 1 We  denote  an  element  of  St  by  bi . 

• O = {(/, p):i  el,p  e [0, oo)} 

• / : S — » O is  defined  as  follows.  Let  H (bl,...,bn)  be  the  set  of  bidders  who 
submit  the  highest  bid:  H(bv...,bn}  = ji  e I : bt  > bj  for  ally  e /|  and  let 
i (hj,...,hn)  be  the  smallest  number  in  the  set  II  ( A, , . . . , bn ) , that  is,  the  winner 
of  the  auction.  Finally,  let  b,mx  denote  the  maximum  bid  (that  is, 
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hmax  (£,,...,/>„)  = Max{bx,...,bn}),  and  let  bsecond  {bx,...,bn)  be  the  second- 
highest  bid  (that  is,  bsecond  ( V-A)  = Max{{b„...,bn}\  {hmax(h15. ..,£„)}}  ,9 
Then  f(bv...,bn)  = {i{bx,...,bn),  bsecon\bx,...,bnj). 

How  much  should  a player  bid  in  a second-price  auction?  Since  what  we  have 
described  is  a game-frame  and  not  a game,  we  cannot  answer  the  question  unless  we 
specify  the  player’s  preferences  over  the  set  of  outcomes  O.  Let  us  say  that  Player  i 
in  a second-price  auction  is  selfish  and  greedy  if  she  only  cares  about  whether  or  not 
she  wins  and,  conditional  on  winning,  prefers  to  pay  less;  furthermore,  she  prefers 
winning  to  not  winning  if  and  only  if  she  has  to  pay  less  than  the  true  value  of  the 
object  for  her,  which  we  denote  by  y , and  is  indifferent  between  not  winning  and 
winning  if  she  has  to  pay  v(. . Thus  the  ranking  of  a selfish  and  greedy  player  is  as 
follows  (together  with  everything  that  follows  from  transitivity): 

(i,p)  >-  (/,//)  if  and  only  if  p < p 

(i,  p)  >-.  (j,  //)  for  all  j Y i and  for  all  p , if  and  only  if  p < y 

(/,  y ) ( j , p')  for  all  j Y i and  for  all  p 

(j,  p)  (k,  p')  for  all  j it  i\  k Y i and  for  all  p and  p 


An  ordinal  utility  function  that  represents  those  preferences 


. 10 

IS 


ul(j>p) 


\vi  - P iff  = 7 
[0  if i*j  ' 


Using  this  utility  function  we  get  the  following  payoff  function  for  Player  i : 


Jv,-6™'(6,,...,6„)  if/  = b, 
jo  if  i*i(b,,...,bn) 


We  can  now  state  the  following  theorem.  The  proof  is  given  in  Appendix  l.A  at 
the  end  of  this  chapter. 


9 For  example,  if  n = 5,  bx  = 10,  b2  = 14,  b3  = 8,  64  = 14  and  b5  = 14  then  H (10,14,8,14,14)  = 
{2,4,5},  i (10,14, 8,14,14)  = 2 , hmax  (10,14,8,14,14)  = bsecond  (10,14,8,14,14)  = 14  • 


10  Of  course  there  are  many  more.  For  example,  also  the  following  utility  function  represents 


those  preferences:  U,  ( j,p)  = 


if  i = j 
if  i * j 
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Theorem  1.1  [Vickrey,  1961].  In  a second-price  auction,  if  Player  i is  selfish  and 
greedy  then  it  is  a weakly  dominant  strategy  for  Player  i to  bid  her  true  value,  that 
is,  to  choose  bt  = v,. . 

Note  that,  for  a player  who  is  not  selfish  and  greedy,  Theorem  1.1  is  not  true.  For 
example,  if  a player  has  the  same  preferences  as  above  for  the  case  where  she  wins, 
but,  conditional  on  not  winning,  prefers  the  other  player  to  pay  as  much  as  possible 
(she  is  spiteful)  or  as  little  as  possible  (she  is  generous),  then  bidding  her  true  value  is 
no  longer  a dominant  strategy. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  1.E.3  of  Appendix  l.E  at  the 
end  of  this  chapter. 


1.4.  The  pivotal  mechanism 

An  article  in  the  Davis  Enterprise  (the  local  newspaper  in  Davis,  California)  on 
January  12,  2001  started  with  the  following  paragraph: 

By  consensus,  the  Davis  City  Council  agreed  Wednesday  to 
order  a communitywide  public  opinion  poll  to  gauge  how 
much  Davis  residents  would  be  willing  to  pay  for  a park  tax 
and  a public  safety  tax. 

Opinion  polls  of  this  type  are  worthwhile  only  if  there  are  reasons  to  believe  that 
the  people  who  are  interviewed  will  respond  honestly.  But  will  they?  If  I would  like 
more  parks  and  believe  that  the  final  tax  I will  have  to  pay  is  independent  of  the 
amount  I state  in  the  interview,  I would  have  an  incentive  to  overstate  my 
willingness  to  pay,  hoping  to  swing  the  decision  towards  building  a new  park.  On 
the  other  hand,  if  I fear  that  the  final  tax  will  be  affected  by  the  amount  I report, 
then  I might  have  an  incentive  to  understate  my  willingness  to  pay. 

The  pivotal  mechanism,  or  Clarke  mechanism,  is  a game  designed  to  give  the 
participants  an  incentive  to  report  their  true  willingness  to  pay. 

A public  project,  say  to  build  a park,  is  under  consideration.  The  cost  of  the  project 
is  $C.  There  are  n individuals  in  the  community.  If  the  project  is  carried  out, 
individual  i (i  = 1,...,  n)  will  have  to  pay  $cj  (with  c1+c2+...  + cn  =C);  these 
amounts  are  specified  as  part  of  the  project.  Note  that  we  allow  for  the  possibility 
that  some  individuals  might  have  to  contribute  a larger  share  of  the  total  cost  C 
than  others  (e.g.  because  they  live  closer  to  the  projected  park  and  would  therefore 
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benefit  more  from  it).  Individual  i has  an  initial  wealth  of  $nil  > 0.  If  the  project  is 
carried  out,  individual  i receives  benefits  from  it  that  she  considers  equivalent  to 
receiving  $v. . Note  that  for  some  individual  i,  vj  could  be  negative,  that  is,  the 
individual  could  be  harmed  by  the  project  (e.g.  because  she  likes  peace  and  quiet  and 
a park  close  to  her  home  would  bring  extra  traffic  and  noise).  We  assume  that 
individual  i has  the  following  utility-of-wealth  function: 


t/,(S/n)  = 

[m  + vl 


if  the  project  is  not  carried  out 
if  the  project  is  carried  out 


n 

The  socially  efficient  decision  is  to  carry  out  the  project  if  and  only  if  ^vz.  > C 

i= 1 
n 

(recall  that  y is  the  summation  sign:  y v(.  is  a short-hand  for  v,  + v2  + ...  + v„). 

i=i 

For  example,  suppose  that  n = 2,  mj  = 50,  m2  - 60,  = 19,  v2  = - 15,  C = 6, 

n 

q = 6,  c2  = 0.  In  this  case  y y(  = 1 9 - 1 5 = 4 < C = 6 hence  the  project  should 

i= 1 

not  be  carried  out.  To  see  this  consider  the  following  table. 


Utility  of 
individual  1 
Utility  of 
individual  2 


If  the  project  is 
not  carried  out 

50 

60 


If  the  project  is 
carried  out 

50+19-6  = 63 
60-15  = 45 


If  the  project  is  carried  out,  Individual  1 has  a utility  gain  of  13,  while  Individual  2 
has  a utility  loss  of  15.  Since  the  loss  is  greater  than  the  gain,  we  have  a Pareto 
inefficient  situation.  Individual  2 could  propose  the  following  alternative  to 
Individual  1:  let  us  not  carry  out  the  project  and  I will  pay  you  $14.  Then 
Individual  Ts  wealth  and  utility  would  be  50  + 14  = 64  and  Individual  2’s  wealth 
and  utility  would  be  60  - 14  = 46  and  thus  they  would  both  be  better  off. 

Thus  Pareto  efficiency  requires  that  the  project  be  carried  out  if  and  only  if 

n 

y v,.  > C . This  would  be  a simple  decision  for  the  government  if  it  knew  the  v;.  ’s. 

i=i 

But,  typically,  these  values  are  private  information  to  the  individuals.  Can  the 
government  find  a way  to  induce  the  individuals  to  reveal  their  true  valuations?  It 
seems  that  in  general  the  answer  is  No:  those  who  gain  from  the  project  would  have 
an  incentive  to  overstate  their  potential  gains,  while  those  who  suffer  would  have 
an  incentive  to  overstate  their  potential  losses.  Influenced  by  Vickrey’s  work  on 
second-price  auctions,  Clarke  suggested  the  following  procedure  or  game.  Each 
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individual  i is  asked  to  submit  a number  vv  which  will  be  interpreted  as  the  gross 
benefit  (if  positive)  or  harm  (if  negative)  that  individual  i associates  with  the  project. 
Note  that,  in  principle,  individual  i can  lie  and  report  a value  vv,  which  is  different 
from  the  true  value  v;. . Then  the  decision  will  be: 


Carry  out  the  project? 


Yes 


if  Xw  >C 

M 


No  if  X wj  - C 

J= 1 

However,  this  is  not  the  end  of  the  story.  Each  individual  will  be  classified  as  either 
not  pivotal  or  pivotal. 


Individual  i is  not  pivotal  if 


either 


or 


V Wj  > C and  X wj  > X c 

V / 1 j*i  M J 

f n ^ 

X M'/ - c and  Xw-Xc 

V / 1 /••<  j*i  J 


and  she  is  pivotal  otherwise.  In  other  words  individual  i is  pivotal  if  the 
decision  about  the  project  that  would  be  made  in  the  restricted  society  resulting 
from  removing  individual  i is  different  from  the  decision  that  is  made  when  she 
is  included.  If  an  individual  is  not  pivotal  then  she  has  to  pay  no  taxes.  If 


individual  i is  pivotal  then  she  has  to  pay  a tax  in  the  amount  of 


Xw~Xc 


j*i 


j *t 


the  absolute  value  of  ^ Wj  - . (recall  that  the  absolute  value  of  a is  equal  to 


j*i 


j*i 


a , if  a is  positive,  and  to  - a , if  a is  negative;  for  instance,  |4|  = 4 and 
|-4|  = -(-4)  = 4). 


For  example,  let  n = 3,  C = 10,  c,  = 3,  c2  = 2,  c3  = 5.  Suppose  that  they  state  the 
following  benefits/losses  (which  may  or  may  not  be  the  true  ones):  wt  = -1, 

3 

w2  = 8 , vv  = 3 . Then  X w,  = 10  = C . Thus  the  project  will  not  be  carried  out. 

i=l 

Who  is  pivotal?  The  answer  is  provided  in  the  following  table. 
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Individual 

Xwj 

(including  i) 

ECj 

(including  i) 

Decision 

Swj  j * i 
(without  i) 

Scj  j + i 
(without  i) 

Decision 

Pivotal? 

Tax 

1 

10 

10 

No 

8 + 3 = 11 

2 + 5 = 7 

Yes 

Yes 

11-7  = 4 

2 

10 

10 

No 

-1+3=2 

3 + 5 = 8 

No 

No 

0 

3 

10 

10 

No 

-1  + 8 = 7 

3+2  = 5 

Yes 

Yes 

7-5  = 2 

It  may  seem  that,  since  it  involves  paying  a tax,  being  pivotal  is  a bad  thing  and  one 
should  try  to  avoid  it.  It  is  certainly  possible  for  individual  i to  make  sure  that  she 

is  not  pivotal:  all  she  has  to  do  is  to  report  wt  =ci;  in  fact,  if  ^ n+  > y c , then 

j*i  j*1 

n 

adding  q to  both  sides  yields  y w+  > C and  if  y w;.  < y c . then  adding  c.  to 

7= 1 j*i  j*i 

n 

both  sides  yields  y w . <C  . It  is  not  true,  however,  that  it  is  best  to  avoid  being 

7=1 

pivotal.  The  following  example  shows  that  one  can  gain  by  being  truthful  even  if  it 
involves  being  pivotal  and  thus  having  to  pay  a tax.  Let  n = 4,  C = 15,  c,  = 5, 
c2  = 0,  c3  = 5 and  c4  = 5.  Suppose  that  m 3 = 40  and  v,  =25.  Imagine  that  you 
are  individual  1 and,  for  whatever  reason,  you  expect  the  following  reports  by  the 
other  individuals:  w2  = -40,  w3  = 15  and  w4  = 20.  If  you  report  w,  = c,  = 5 then 

you  ensure  that  you  are  not  pivotal.  In  this  case 

4 

wt  = 5 -40  + 15  + 20  = 0 < C = 15  and  thus  the  project  is  not  carried  out  and 

7=1 

your  utility  is  equal  to  m l = 40.  If  you  report  truthfully,  that  is,  you  report 


4 

w1=v1  = 25  then  ^ w.  = 25  - 40  + 15  + 20  = 20  > C = 15  and  the  project  is  carried 

7=1 


4 4 

out;  furthermore,  you  are  pivotal  and  have  to  pay  a tax  tl  equal  to  EW7'-ZC7 

7=2  7=2 


|(— 40  + 15  + 20)  — (0  + 5 + 5)|  = |— 15  =15  and  your  utility  will  be  m1  + Vj  - g —tx  = 


40  + 25-5-  15  =45;  hence  you  are  better  off.  Indeed,  the  following  theorem  states 
that  no  individual  can  ever  gain  by  lying. 


The  proof  of  Theorem  1.2  is  given  in  Appendix  l.A  at  the  end  of  this  chapter. 
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Theorem  1.2  [Clarke,  1971].  In  the  pivotal  mechanism  (under  the  assumed 
preferences)  truthful  revelation  (that  is,  stating  wL  = vL ) is  a weakly  dominant 
strategy  for  every  Player  i. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  1.E.4  of  Appendix  l.E  at  the 
end  of  this  chapter. 


1.5.  Iterated  deletion  procedures 

If  in  a game  a player  has  a (weakly  or  strictly)  dominant  strategy  then  the  player 
ought  to  choose  that  strategy:  in  the  case  of  strict  dominance  choosing  any  other 
strategy  guarantees  that  the  player  will  do  worse  and  in  the  case  of  weak  dominance 
no  other  strategy  can  give  a better  outcome,  no  matter  what  the  other  players  do. 
Unfortunately,  games  that  have  a dominant-strategy  equilibrium  are  not  very 
common.  What  should  a player  do  when  she  does  not  have  a dominant  strategy? 
We  shall  consider  two  iterative  deletion  procedures  that  can  help  solve  some  games. 

1.5.1.  IDSDS.  The  Iterated  Deletion  of  Strictly  Dominated  Strategies  (IDSDS)  is  the 
following  procedure  or  algorithm.  Given  a finite  ordinal  strategic-form  game  G,  let 

G be  the  game  obtained  by  removing  from  G,  for  every  Player  i,  those  strategies  of 

2 

Player  i (if  any)  that  are  strictly  dominated  in  G by  some  other  strategy;  let  G be 
the  game  obtained  by  removing  from  G , for  every  Player  z,  those  strategies  of 
Player  i (if  any)  that  are  strictly  dominated  in  G by  some  other  strategy,  and  so  on. 
Let  G'  be  the  output  of  this  procedure.  Since  the  initial  game  G is  finite,  G°°  will 
be  obtained  in  a finite  number  of  steps.  Figure  1.10  illustrates  this  procedure.  If 
G'  contains  a single  strategy  profile  (this  is  not  the  case  in  the  example  of  Figure 
1.10)  then  we  call  that  strategy  profile  the  iterated  strict  dominant-strategy 
equilibrium.  If  G“  contains  two  or  more  strategy  profiles  then  we  refer  to  those 
strategy  profiles  merely  as  the  output  of  the  IDSDS  procedure.  For  example,  in  the 
game  of  Figure  1.10  the  output  of  the  IDSDS  procedure  is  the  set  of  strategy 
profiles  {(A,  e ),  (A, /), (B, e),  ( B , /)}  . 

What  is  the  significance  of  the  output  of  the  IDSDS  procedure?  Consider  game  G of 
Figure  1.10.  Since,  for  Player  2,  h is  strictly  dominated  by  g,  if  Player  2 is  rational 
she  will  not  play  h.  Thus,  if  Player  1 believes  that  Player  2 is  rational  then  he 
believes  that  Player  2 will  not  play  h,  that  is,  he  restricts  attention  to  game  G1 ; 
since,  in  Gl , D is  strictly  dominated  by  G for  Player  1,  if  Player  1 is  rational  he  will 
not  play  D.  It  follows  that  if  Player  2 believes  that  Player  1 is  rational  and  that 
Player  1 believes  that  Player  2 is  rational,  then  Player  2 restricts  attention  to  game 
G 2 where  rationality  requires  that  Player  2 not  play  g,  etc.  It  will  be  shown  in  a 
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later  chapter  that  if  the  players  are  rational  and  there  is  common  belief  of 
rationality11  then  only  strategy  profiles  that  survive  the  IDSDS  procedure  can  be 
played;  the  converse  is  also  true:  any  strategy  profile  that  survives  the  IDSDS 
procedure  is  compatible  with  common  belief  of  rationality. 


Player  2 


e 

/ 

8 

h 

A 

6,3 

4,4 

4. 1 

3,0 

Player  B 

5,4 

6,3 

0,2 

5 , 1 

1 C 

5,0 

3,2 

6, 1 

4,0 

D 

2,0 

2,3 

3,3 

6,  1 

e 

delete  h 

(dominated  by  g) 

Y 

Player  2 
/ g 

A 

6,3 

4,4 

4, 1 

Player  B 

5,4 

6,3 

0,2 

n 

1 C 

5,0 

3,2 

6,  1 

U 

D 

2,0 

2,3 

3,3 

G = G° 


Player  A 
I B 


Player 

1 


Player  2 

e f 


6,3 

4,4 

5,4 

6,3 

G=G° 


delete  C 
(dominated  by  A) 


Player  2 
e / 


6,3 

4,4 

5,4 

6,3 

5 . 0 

3,2 

delete  D 
(dominated  by  C) 


Player 

1 


Player  2 
/ 


6,3 

4,4 

4, 1 

5,4 

6,3 

0,2 

5,0 

3,2 

6,1 

Figure  1.10 


Remark  1.5  . In  finite  games,  the  order  in  which  strictly  dominated  strategies  are 
deleted  is  irrelevant,  in  the  sense  that  any  sequence  of  deletions  of  strictly 
dominated  strategies  leads  to  the  same  output. 


11  An  event  E is  commonly  believed  if  everybody  believes  E and  everybody  believes  that 
everybody  believes  E and  everybody  believes  that  everybody  believes  that  everybody  believes 
E,  and  so  on. 
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1.5.1.  IDWDS.  The  Iterated  Deletion  of  Weakly  Dominated  Strategies  (IDWDS)  is  a 
weakening  of  IDSDS  in  that  it  allows  the  deletion  also  of  weakly  dominated 
strategies.  However,  this  procedure  has  to  be  defined  carefully,  since  in  this  case  the 
order  of  deletion  can  matter.  To  see  this,  consider  the  game  shown  in  Figure  1.11. 


Player 

1 


Player  2 


L 

R 

A 

4 , 0 

o 

v* 

o 

T 

3 , 2 

2 , 2 

M 

1 , 1 

0 , 0 

B 

o 

V# 

o 

1 , 1 

Figure  1.11 

Since  M is  strictly  dominated  by  T for  Player  1,  we  can  delete  it  and  obtain 


A 

Player  T 


Player  2 
L R 


4 , 0 

o 

o 

3 , 2 

2 , 2 

O 

r\ 

o 

1 , 1 

Now  L is  weakly  dominated  by  R for  Player  2.  Deleting  L we  are  left  with 


Player 

1 


Player  2 

A 
T 

B 


R 

0 , 0 

2 , 2 

1 , 1 


Now  A and  B are  strictly  dominated  by  T.  Deleting  them  we  are  left  with 


(T,R)  with  corresponding  payoffs  (2,2). 
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Alternatively,  going  back  to  the  game  of  Figure  1.11,  we  could  note  that  B is 
strictly  dominated  by  T;  deleting  B we  are  left  with 


Player 

1 


Player  2 
L R 


4 , 0 

O 

o 

3 , 2 

2 , 2 

1 , 1 

o 

o 

Now  R is  weakly  dominated  by  L for  Player  2.  Deleting  R we  are  left  with 


Player  2 
L 


Player 

1 


A 

T 

M 


4 , 0 


3 , 2 


1 , 1 


Now  T and  M are  strictly  dominated  by  A and  deleting  them  leads  to  (A,L) 


with  corresponding  payoffs  (4,0).  Since  one  order  of  deletion  leads  to  (T,R)  with 
payoffs  (2,2)  and  the  other  to  (A,L)  with  payoffs  (4,0),  the  procedure  is  not  well 
defined:  the  output  of  a well-defined  procedure  should  be  unique. 


Definition  1.8  (IDWDS).  In  order  to  avoid  the  problem  illustrated  above,  the 
IDWDS  procedure  is  defined  as  follows:  at  every  step  identify,  for  every  player,  all  the 
strategies  that  are  weakly  (or  strictly)  dominated  and  then  delete  all  such  strategies  in  that 
step.  If  the  output  of  the  IDWDS  procedure  is  a single  strategy  profile  then  we  call 
that  strategy  profile  the  iterated  weak  dominant-strategy  equilibrium  (otherwise  we 
just  use  the  expression  ‘output  of  the  IDWDS  procedure’). 


For  example,  the  IDWDS  procedure  when  applied  to  the  game  of  Figure  1.11  leads 
to  the  following  output: 


Player 

1 


A 

T 


Player  2 
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Hence  the  game  of  Figure  1.11  does  not  have  an  iterated  weak  dominant-strategy 
equilibrium. 

The  interpretation  of  the  output  of  the  IDWDS  procedure  is  not  as  simple  as  that 
of  the  IDSDS  procedure:  certainly  common  belief  of  rationality  is  not  sufficient.  In 
order  to  delete  weakly  dominated  strategies  one  needs  to  appeal  not  only  to 
rationality  but  also  to  some  notion  of  caution:  a player  should  not  completely  rule 
out  any  of  her  opponents’  strategies.  However,  this  notion  of  caution  is  in  direct 
conflict  with  the  process  of  deletion  of  strategies.  In  this  book  we  shall  not  address 
the  issue  of  how  to  justify  the  IDWDS  procedure. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  1.E.5  of  Appendix  l.E  at  the 
end  of  this  chapter. 


1.6.  Nash  equilibrium 

Games  where  either  the  IDSDS  procedure  or  the  IDWDS  procedure  leads  to  a 
unique  strategy  profile  are  not  very  common.  How  can  one  then  “solve”  games  that 
are  not  solved  by  either  procedure?  The  notion  of  Nash  equilibrium  offers  a more 
general  alternative.  We  first  define  Nash  equilibrium  for  a two-player  game. 

Definition  1.9.  Given  an  ordinal  game  in  strategic  form  with  two  players,  a 
strategy  profile  s =|i1,i2je51x,S2  is  a Nash  equilibrium  if  the  following  two 
conditions  are  satisfied: 

(1)  for  every  y e .S', , nx  ( y , y ) > nx  (y  ,y)  (or  stated  in  terms  of  outcomes  and 

preferences,  /(y,y()  , / ( y , y ) ) and 

(2)  for  every  s2eS2,  ;r2(y,y)  > y(y,y)  (or,  /(y,y)  £ 2 /(y,s2)). 
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For  example,  in  the  game  of  Figure  1.12  there  are  two  Nash  equilibria:  ( T,L ) and 

(B,q. 


Player  2 


L 

C 

R 

Player  T 

3 , 2 

o 

e\ 

o 

1 , 1 

1 M 

3 , 0 

1 , 5 

4 , 4 

B 

1 , o 

2 , 3 

3 , 0 

Figure  1.12 

There  are  several  possible  interpretations  of  this  definition. 

‘No  regret’  interpretation:  s is  a Nash  equilibrium  if  there  is  no  player  who,  after 
observing  the  opponent’s  choice,  regrets  his  own  choice  (in  the  sense  that  he  could 
have  done  better  with  a different  strategy  of  his,  given  the  observed  strategy  of  the 
opponent). 

‘Self-enforcing  agreement’  interpretation:  imagine  that  the  players  are  able  to 
communicate  before  playing  the  game  and  reach  a non-binding  agreement 
expressed  as  a strategy  profile  s ; then  no  player  will  have  an  incentive  to  deviate 
from  the  agreement,  if  she  believes  that  the  other  player  will  follow  the  agreement, 
if  and  only  if  s*  is  a Nash  equilibrium. 

‘Viable  recommendation’  interpretation:  imagine  that  a third  party  makes  a 
public  recommendation  to  each  player  on  what  strategy  to  play;  then  no  player  will 
have  an  incentive  to  deviate  from  the  recommendation,  if  she  believes  that  the  other 
players  will  follow  the  recommendation,  if  and  only  if  the  recommended  strategy 
profile  is  a Nash  equilibrium. 

‘ Transparency  of  reason’  interpretation:  if  players  are  all  “equally  rational”  and 
Player  2 reaches  the  conclusion  that  she  should  play  y,  then  Player  1 must  be  able  to 
duplicate  Player  2’s  reasoning  process  and  come  to  the  same  conclusion;  it  follows 
that  Player  l’s  choice  of  strategy  is  not  rational  unless  it  is  a strategy  x that  is 
optimal  against  y.  A similar  argument  applies  to  Player  2’s  choice  of  strategy  (y  must 
be  optimal  against  x)  and  thus  (x,y)  is  a Nash  equilibrium. 

It  is  clear  that  all  of  the  above  interpretations  are  mere  rewording  of  the  formal 
definition  of  Nash  equilibrium  in  terms  of  the  inequalities  of  Definition  1.9. 
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The  generalization  of  Definition  1.9  to  games  with  more  than  two  players  is 
straightforward. 

Definition  1.10.  Given  an  ordinal  game  in  strategic  form  with  n players,  a strategy 
profile  s*  e S is  a Nash  equilibrium  if  the  following  inequalities  are  satisfied: 

for  every  Player  i,  ^(s*)  > ^.(yV..,y^,y,y+1,...,y*)  for  all  y e Si . 

The  reader  should  convince  himself/herself  that  a (weak  or  strict)  dominant 
strategy  equilibrium  is  a Nash  equilibrium  and  the  same  is  true  of  a (weak  or  strict) 
iterated  dominant-strategy  equilibrium. 

Definition  1.11.  Consider  an  ordinal  game  in  strategic  form,  a Player  i and  a 
strategy  profile  J_t  e S_t  of  the  players  other  than  i.  A strategy  y e St  of  Player  i is 
a best  reply  (or  best  response)  to  ~s_f  if  (y , T ; ) > ni  (s',  ) for  every  s’  eS, . 

For  example,  in  the  game  of  Figure  1.12,  reproduced  below,  for  Player  1 there  are 
two  best  replies  to  L,  namely  M and  T,  while  the  unique  best  reply  to  C is  B and  the 
unique  best  reply  to  R is  M;  for  Player  2 the  best  reply  to  Tis  L,  the  best  reply  to  M 
is  C and  the  best  reply  to  B is  C. 


Player  T 

1 M 

B 


Player  2 


Remark  1.6.  Using  the  notion  of  best  reply,  an  alternative  definition  of  Nash 
equilibrium  is  as  follows:  J e S is  a Nash  equilibrium  if  and  only  if,  for  every 
Player  i,  y e .S',  is  a best  reply  to  T e S , . 

A quick  way  to  find  the  Nash  equilibria  of  a two-player  game  is  as  follows:  in  each 
column  of  the  table  underline  the  largest  payoff  of  Player  1 in  that  column  (if  there 
are  several  instances,  underline  them  all)  and  in  each  row  underline  the  largest 
payoff  of  Player  2 in  that  row;  if  a cell  has  both  payoffs  underlined  then  the 
corresponding  strategy  profile  is  a Nash  equilibrium.  Underlining  of  the  maximum 
payoff  of  Player  1 in  a given  column  identifies  the  best  reply  of  Player  1 to  the 
strategy  of  Player  2 that  labels  that  column  and  similarly  for  Player  2.  This 
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procedure  is  illustrated  in  Figure  1.13,  where  there  is  a unique  Nash  equilibrium, 
namely  (B,E). 


A 

Player  B 

1 C 

D 


Figure  1.13 

Exercise  1.2  in  Appendix  l.E  explains  how  to  represent  a three-player  game  by 
means  of  a set  of  tables.  In  a three-player  game  the  procedure  for  finding  the  Nash 
equilibria  is  the  same,  with  the  necessary  adaptation  for  Player  3:  in  each  cell 
underline  the  payoff  of  Player  3 if  and  only  if  her  payoff  is  the  largest  of  all  her 
payoffs  in  the  same  cell  across  different  tables.  This  is  illustrated  in  Figure  1.14, 
where  there  is  a unique  Nash  equilibrium,  namely  ( B,R , W). 


Player  2 


E F G H 


0 

r\ 

Tt- 1 

3,  2 

2,  3 

4,  1 

4,  2 

2,  1 

1,  2 

0,  2 

3 , 6 

5,  5 

3,  1 

5,  0 

2,  3 

3,  2 

1,2 

3 , 3 

Player  2 


Player  2 


L 

R 

L 

R 

Player  T 

ol 

rs 

0 

rs 

0 

2,8,6 

T 

ol 

r\ 

0 

rs 

0 

1,2,5 

1 B 

5,3,2 

3,4,2 

B 

1,6,1 

T-H 

c 

O 

c 

0 

Player  3 chooses  W 


Player  3 chooses  E 


Figure  1.14 


Unfortunately,  when  the  game  has  too  many  players  or  too  many  strategies  and  it 
is  thus  impossible  or  impractical  to  represent  it  as  a set  of  tables,  there  is  no  quick 
procedure  for  finding  the  Nash  equilibria:  one  must  simply  apply  the  definition  of 
Nash  equilibrium.  For  example,  consider  the  following  game. 
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Example  1.1.  There  are  50  players.  A benefactor  asks  them  to  simultaneously  and 
secretly  write  on  a piece  of  paper  a request,  which  must  be  a multiple  of  $10  up  to  a 
maximum  of  $100  (thus  the  possible  strategies  of  each  player  are  $10,  $20,  ...,  $90, 
$100).  He  will  then  proceed  as  follows:  if  not  more  than  10%  of  the  players  ask  for 
$100  then  he  will  grant  every  player’s  request,  otherwise  every  player  will  get 
nothing.  Assume  that  every  player  is  selfish  and  greedy  (only  cares  about  how 
much  money  she  gets  and  prefers  more  money  to  less).  What  are  the  Nash 
equilibria  of  this  game?  There  are  several: 

• every  strategy  profile  where  7 or  more  players  request  $100  is  a Nash 
equilibrium  (everybody  gets  nothing  and  no  player  can  get  a positive 
amount  by  unilaterally  changing  her  request,  since  there  will  still  be  more 
than  10%  requesting  $100;  on  the  other  hand,  convince  yourself  that  a 
strategy  profile  where  exactly  6 players  request  $100  is  not  a Nash 
equilibrium), 

• every  strategy  profile  where  exactly  5 players  request  $100  and  the 
remaining  players  request  $90  is  a Nash  equilibrium. 

Any  other  strategy  profile  is  not  a Nash  equilibrium:  (1)  if  fewer  than  5 players 
request  $100,  then  a player  who  requested  less  than  $100  can  increase  her  payoff  by 
switching  to  a request  of  $100,  (2)  if  exactly  5 players  request  $100  and  among  the 
remaining  players  there  is  one  who  is  not  requesting  $90,  then  that  player  can 
increase  her  payoff  by  increasing  her  request  to  $90. 

We  conclude  this  section  by  noting  that,  since  so  far  we  have  restricted  attention  to 
ordinal  games,  there  is  no  guarantee  that  an  arbitrary  game  will  have  at  least  one 
Nash  equilibrium.  An  example  of  a game  that  has  no  Nash  equilibria  is  the 
Matching  Penny  game.  This  is  a simultaneous  two-player  game  where  each  player 
has  a coin  and  decides  whether  to  show  the  Heads  face  or  the  Tails  face.  If  both 
choose  H or  both  choose  T then  Player  1 wins,  otherwise  Player  2 wins.  Each 
player  strictly  prefers  the  outcome  where  she  herself  wins  to  the  alternative 
outcome.  The  game  is  illustrated  in  Figure  1.15. 


Player  1 


Player  2 


H 

T 

H 

1 

0 

0 

1 

T 

0 

1 

1 

0 

Figure  1.15 
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This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  1.E.6  of  Appendix  l.E  at  the 
end  of  this  chapter. 


1.7.  Games  with  infinite  strategy  sets 

Games  where  the  strategy  set  of  one  or  more  players  is  infinite  cannot  be 
represented  using  a table  or  set  of  tables.  However,  all  the  concepts  introduced  in 
this  chapter  can  still  be  applied.  In  this  section  we  will  focus  on  the  notion  of  Nash 
equilibrium.  We  start  with  an  example. 


Example  1.2.  There  are  two  players.  Each  player  has  to  write  down  a real  number 
(not  necessarily  an  integer)  greater  than  or  equal  to  1;  thus  the  strategy  sets  are 
5,  = S2  = [1,  oo) . Payoffs  are  as  follows  {nx  is  the  payoff  of  Player  1,  n2  the  payoff  of 
Player  2,  x is  the  number  written  by  Player  1 and  y the  number  written  by  Player 
2): 


T(x,y)  = 


if  x< y 

if  x>  y’ 


^(x,y)  = 


if  x> y 
f x^y 


What  are  the  Nash  equilibria  of  this  game? 

There  is  only  one  Nash  equilibrium,  namely  (1,1)  with  payoffs  (0,0).  First  of  all,  we 
must  show  that  (1,1)  is  indeed  a Nash  equilibrium.  If  Player  switched  to  some 
x > 1 then  her  payoff  would  remain  0 and  the  same  is  true  for  Player  2 if  he 
unilaterally  switched  to  some  y > 1:  nx  (x,  1)  = 0 , for  all  x e [1,  qo)  and  n2  (1,  y)  = 0 , 
for  all  y e [1,  oo) . Now  we  show  that  no  other  pair  (x,y)  is  a Nash  equilibrium. 
Consider  first  an  arbitrary  pair  (x,y)  with  x = y > 1.  Then  ;q(x,y)  = 0,  but  if  Player 
1 switched  to  an  x strictly  between  1 and  x (l<x<x)  her  payoff  would  be 
ftfx,  v)  = x-1  > 0 (recall  that,  by  hypothesis,  x = y).  Now  consider  an  arbitrary 
(x,y)  with  x < y.  Then  7T,(x,y)  = x - 1 , but  if  Player  1 switched  to  an  x strictly 
between  x and  y (x<x<y)  her  payoff  would  be  nfx, y)  = x-1  > x-1 . The 
argument  for  ruling  out  pairs  (x,y)  with  y < x is  similar. 

Note  the  interesting  fact  that,  for  Player  1,  x = 1 is  a weakly  dominated  strategy: 
indeed  it  is  weakly  dominated  by  any  other  strategy:  x = 1 guarantees  a payoff  of  0 
for  Player  1,  while  any  x > 1 would  yield  a positive  payoff  to  Player  1 in  some  cases 
(against  any  y > x)  and  0 in  the  remaining  cases.  The  same  is  true  for  Player  2. 
Thus  in  this  game  there  is  a unique  Nash  equilibrium  where  the  strategy  of  each  player 
is  weakly  dominated). 
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[Note:  the  rest  of  this  section  makes  use  of  calculus.  The  reader  who  is  not  familiar 
with  calculus  should  skip  this  part.] 

We  conclude  this  section  with  an  example  based  on  the  analysis  of  competition 
among  firms  proposed  by  Augustine  Cournot  in  a book  published  in  1838.  In  fact, 
Cournot  is  the  one  who  invented  what  we  now  call  Nash  equilibrium,  although  his 
analysis  was  restricted  to  a small  class  of  games.  Consider  n > 2 firms  which  produce 
an  identical  product.  Let  qi  be  the  quantity  produced  by  Firm  i (i  =1  For 
Firm  i the  cost  of  producing  qt  units  of  output  is  ciqi,  where  ci  is  a positive 
constant.  For  simplicity  we  will  restrict  attention  to  the  case  of  two  firms  in  = 2) 
and  identical  cost  functions:  cx  = c2  = c . Let  Q be  total  industry  output,  that  is, 
Q = q\+q2.  The  price  at  which  each  firm  can  sell  each  unit  of  output  is  given  by 
the  inverse  demand  function  P = a-bQ  where  a and  b are  positive  constants. 
Cournot  assumed  that  each  firm  was  only  interested  in  its  own  profit  and  preferred 
higher  profit  to  lower  profit  (that  is,  each  firm  is  “selfish  and  greedy”).  The  profit 
function  of  Firm  1 is  given  by 

(<7i ,q2)  = Pq i - cqx  =[a- b{qx  + q2 )] qx  - cqx  =(a- c)qx  - b{qx  f - bqxq2 . 
Similarly,  the  profit  function  of  Firm  2 is  given  by 

n2 (qx ,q2)  = (a-c)q2- b(q2  f - bqxq2 

Cournot  defined  an  equilibrium  as  a pair  [qx  ,q2  j such  that 

nx  (qx  ,q2  ) > Kx  [qx  ,q2  ),  for  every  qx>  0 (♦) 

and 

7t2(q\,q2)>K2  (</,  ,q2),  for  every  q2>  0 (♦) 


Of  course,  this  is  the  same  as  saying  that  (qx,q2)  is  a Nash  equilibrium  of  the  game 
where  the  players  are  the  two  firms,  the  strategy  sets  are  Sx=  S2=  [0,  oo)  and  the 
payoff  functions  are  the  profit  functions.  How  do  we  find  a Nash  equilibrium? 
First  of  all,  note  that  the  profit  functions  are  differentiable.  Secondly  note  that  (♦) 

says  that,  having  fixed  the  value  of  q2  at  q2 , the  function  kx  [qx,q2 ) viewed  as  a 
function  of  qx  alone  is  maximized  at  the  point  qx  = qx  . A necessary  condition  for 
this  (if  Ti  > 0 ) is  that  the  derivative  of  this  function  be  zero  at  the  point  qx-qx  , 

C)  TC 

that  is,  it  must  be  that  — ~(qx  ,q2)  = 0.  This  condition  is  also  sufficient  since  the 

dq  i 
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7C 

second  derivative  of  this  function  is  always  negative  ( — rMc/  , q2 ) = -2b  for  every 

8q[  ’ 

c)  TC 

(q{,q2)).  Similarly,  by  (♦),  it  must  be  that  — =-(ql,q2)  = 0.  Thus  the  Nash 

Sq2 

equilibrium  is  found  by  solving  the  system  of  two  equations 

. The  corresponding  price  is 


a-c-2bqx  -bq2  = 0 . _ _ a-c 

. i he  solution  is  qx  = qn  = 

3 b 


I a-c- 2bq2  -bqx  =0 


P = a-b 


^ , a- c\  a + 2c 


3b 


and  the  corresponding  profits  are 


= ni^Sb->^b^  - ^ ' ^or  example)  ^ a~  25,  b = 2,  c = 1 then  the 

Nash  equilibrium  is  given  by  (4,4)  with  corresponding  profits  of  32  for  each  firm. 
The  analysis  can  easily  be  extended  to  the  case  of  more  than  two  firms.  The  reader 
who  is  interested  in  further  exploring  the  topic  of  competition  among  firms  can 
consult  any  textbook  on  Industrial  Organization. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  1.E.7  of  Appendix  l.E  at  the 
end  of  this  chapter. 
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Appendix  l.A:  Proofs  of  theorems 

Theorem  1.1  [Vickrey,  1961].  In  a second-price  auction,  if  Player  i is  selfish  and 
greedy  then  it  is  a weakly  dominant  strategy  for  Player  i to  bid  her  true  value,  that 
is,  to  choose  b(  - v(. . 

Proof.  In  order  to  make  the  notation  simpler  and  the  argument  more  transparent, 
we  give  the  proof  for  the  case  where  n = 2.  We  shall  prove  that  bidding  v,  is  a 
weakly  dominant  strategy  for  Player  1 (the  proof  for  Player  2 is  similar).  Assume 
that  Player  1 is  selfish  and  greedy.  Then  we  can  take  her  payoff  function  to  be  as 
f Vj  - b2  if  bx>b2 

follows:  7rl(bl,b1)  = < .We  need  to  show  that,  whatever  bid 

[0  if  bx  < b2 

Player  2 submits,  Player  1 cannot  get  a higher  payoff  by  submitting  a bid  different 
from  Vj . Two  cases  are  possible  (recall  that  b2  denotes  the  actual  bid  of  Player  2, 
which  is  unknown  to  Player  1). 

Case  1:  b,  < vr  In  this  case,  bidding  v,  makes  Player  1 the  winner  and  his  payoff  is 
Vj  —b2  > 0 . Consider  a different  bid  bx  .If  bx>  b2  then  Player  1 is  still  the  winner 
and  his  payoff  is  still  vt-b2>0.  Thus  such  a bid  is  as  good  as  (hence  not  better 
than)  Vj . If  \ <b2  then  the  winner  is  Player  2 and  Player  1 gets  a payoff  of  0.  Thus 
such  a bid  is  also  not  better  than  vx . 

Case  2:  b2  > vr  In  this  case,  bidding  v,  makes  Player  2 the  winner  and  thus  Player 
1 gets  a payoff  of  0.  Any  other  bid  bx  < b2  gives  the  same  outcome  and  payoff.  On 
the  other  hand,  any  bid  \\  > b2  makes  Player  1 the  winner,  giving  him  a payoff  of 
v,  - b2  < 0 , thus  making  Player  1 worse  off  than  with  a bid  of  v, . ■ 


Theorem  1.2  [Clarke,  1971].  In  the  pivotal  mechanism  (under  the  assumed 
preferences)  truthful  revelation  (that  is,  stating  wt  = v)  is  a weakly  dominant 

strategy  for  every  Player  i. 

Proof.  Consider  an  individual  i and  possible  statements  w-  for  j A i.  Several  cases  are 
possible. 
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Case  1:  X wi  > Z Cj  and  v;  + ^ vv;.  > c;.  + ^ Cj  = C . Then 

j^i  j^i  yV/  jV* 


decision 

z’s  tax 

z’s  utility 

if  i states  v. 

1 

Yes 

0 

mi+vi  -ct 

if  i states  wi  such  that 

Yes 

0 

m,+vi  ~ci 

wi  + Z wi>C 

j*i 

if  i states  wi  such  that 

w< + Z wj-c 

j*i 

No 

Zvv,-Zc, 

j*i  j*i 

f 

V M j ** 

/ \ 

Individual  z cannot  gain  by  lying  if  and  only  if  mt  + y.  - c;.  > m;  - Zw,-Zc; 

V A'  j*i  ) 

i.e.  if  v.  + X - C which  is  true  by  our  hypothesis. 


Case  2:  X wi  > Z C anc^  v;  + Zwy-  c,  + Z C = C • Then 


J*‘ 


J*i 


if  states  1 


if  states  w . such  that  No 


T+Z^,  -C 


J** 


if  states  w.  such  that  Yes 


decision 

i’s  tax 

i 

’s  utility 

f 

\ 

No 

M 

M 

,jr* 

mt  - 

Twj 

-Eg 

j*i  J ** 

\j** 

j*  y 

No 

Zw/-Zc; 

mi  - 

2>; 

i 

M 

j&i  j*i 

^ j*' 

j*‘  j 

Yes 

0 

m 

• + v,  - 

-c, 

W. 


+Z  wj>c 


l l l 


J** 


Individual  i cannot  gain  by  lying  if  and  only  if  mj  - Z^-Zc- 

V j*i  }*'  J 

i.e.  if  v,  + Z wj  - C which  is  true  by  our  hypothesis. 


> m,  + v.  - c , 
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Case  3:  Z Wj  < Z Cj  and  vj  + X w;  < cj  + Z c . = C . Then 


j*i  j*i 

decision 

if  z states  v . No 

l 

if  i states  w;  such  that  No 

wi + y w.  < C 

j*i 

if  i states  w-  such  that  Yes 


]*>  j*> 

i s tax 
0 


z’s  utility 
m 


W : 


) + X G > C 


ZG“Zw7 

v j*>  j 

(recall  that 

Zg^Zg-) 

j*i  j*i 


Zg-Z 


w. 


V z*'  y 


T \ 

, - > w 


Individual  z cannot  gain  by  lying  if  and  only  if  m,  > /«,  + v,.  - c,  - 2G-I  -, 

V j*'  J*‘  J 

i.e.  if  y + Z - C , which  is  our  hypothesis. 


j*‘ 


Case  4:  Z Wj  < Z cy  and  v;  + X tv,-  > c;.  + Z G = C . Then 

i*i  j*i 

decision  z’s  tax 


i z— I j 
j*i  j*i 


z’s  utility 


if  z states 


Yes 


if  z states  w.  such  that  Yes 


G + Z G > C 

j*i 

if  i states  such  that 

wi + Z wi-c 

j*i 


f 

( 

Zg 

-Zg 

mi  + v,.  - ci  - 

Zg 

-Zg 

V J * 

j*>  ) 

V J*' 

j*i 

) 

(recall  that 

Zg 

*Zg> 

j*i 

j** 

f 

f 

\ 

Zg 

-Zg 

mi  +vi  —ci  - 

Zg 

-Zg 

V J * 

J*i  ) 

V 

) 

(recall  that 

Xg^Zg) 

j*‘  j*i 


No 


in , 


Individual  i cannot  gain  by  lying  if  and  only  if  mi  + v,  - ci  - 
i.e.  if  v,  + Z wj  ^ C , which  is  true  by  our  hypothesis. 


Z<v-Z 


t~  Zjwj 
V J*'  J*1  J 


> mi. 


Since  we  have  covered  all  the  possible  cases,  the  proof  is  complete. 
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Appendix  l.E:  Exercises 

l.E.l.  Exercises  for  Section  1.1:  Game  frames  and  games 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  1.1.  Antonia  and  Bob  cannot  decide  where  to  go  to  dinner.  Antonia 
proposes  the  following  procedure:  she  will  write  on  a piece  of  paper  either  the 
number  2 or  the  number  4 or  the  number  6,  while  Bob  will  write  on  his  piece  of 
paper  either  the  number  1 or  3 or  5.  They  will  write  their  numbers  secretly  and 
independently.  They  then  will  show  each  other  what  they  wrote  and  choose  a 
restaurant  according  to  the  following  rule:  if  the  sum  of  the  two  numbers  is  5 or 
less,  they  will  go  to  a Mexican  restaurant,  if  the  sum  is  7 they  will  go  to  an  Italian 
restaurant  and  if  the  number  is  9 or  more  they  will  go  to  a Japanese  restaurant. 

(a)  Let  Antonia  be  Player  1 and  Bob  Player  2.  Represent  this  situation  as  a game 
frame,  first  by  writing  out  each  element  of  the  quadruple  of  Definition  1.1  and 
then  by  using  a table  (label  the  rows  with  Antonia’s  strategies  and  the  columns 
with  Bob’s  strategies,  so  that  we  can  think  of  Antonia  as  choosing  the  row  and 
Bob  as  choosing  the  column). 

(b)  Suppose  that  Antonia  and  Bob  have  the  following  preferences  (where  M stands 
for  ‘Mexican’,  I for  ‘Italian’  and  / for  ‘Japanese).  For  Antonia: 

M > Antonia  1 ^Antonia  J 5 for  B°b:  1 ^ Bob  M Bob  J • Using  UtlliU  function  with 

values  1,  2 and  3 represent  the  corresponding  reduced-form  game  as  a table. 

Exercise  1.2.  Consider  the  following  two-player  game-frame  where  each  player  is 
given  a set  of  cards  and  each  card  has  a number  on  it.  The  players  are  Antonia 
(Player  1)  and  Bob  (Player  2).  Antonia’s  cards  have  the  following  numbers  (one 
number  on  each  card):  2,  4 and  6,  whereas  Bob’s  cards  are  marked  0,  1 and  2 (thus 
different  numbers  from  the  previous  exercise).  Antonia  chooses  one  of  her  own 
cards  and  Bob  chooses  one  of  his  own  cards:  this  is  done  without  knowing  the 
other  player’s  choice.  The  outcome  depends  on  the  sum  of  the  points  of  the  chosen 
cards,  as  follows.  If  the  sum  of  the  points  on  the  two  chosen  cards  is  greater  than  or 
equal  to  5,  Antonia  gets  $10  minus  that  sum;  otherwise  (that  is,  if  the  sum  is  less 
than  5)  she  gets  nothing;  furthermore,  if  the  sum  of  points  is  an  odd  number,  Bob 
gets  as  many  dollars  as  that  sum;  if  the  sum  of  points  turns  out  to  be  an  even 
number  and  is  less  than  or  equal  to  6,  Bob  gets  $2;  otherwise  he  gets  nothing. 

(a)  Represent  the  game-frame  described  above  by  means  of  a table.  As  in  the 
previous  exercise,  assign  the  rows  to  Antonia  and  the  columns  to  Bob. 

(b)  Using  the  game-frame  of  part  (a)  obtain  a reduced-form  game  by  adding  the 
information  that  each  player  is  selfish  and  greedy.  This  means  that  each  player 
only  cares  about  how  much  money  he/ she  gets  and  prefers  more  money  to  less. 
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Exercise  1.3.  Alice  (Player  1),  Bob  (Player  2),  and  Charlie  (Player  3)  play  the 
following  simultaneous  game.  They  are  sitting  in  different  rooms  facing  a keyboard 
with  only  one  key  and  each  has  to  decide  whether  or  not  to  press  the  key.  Alice 
wins  if  the  number  of  people  who  press  the  key  is  odd  (that  is,  all  three  of  them  or 
only  Alice  or  only  Bob  or  only  Charlie)  , Bob  wins  if  exactly  two  people  (he  may 
be  one  of  them)  press  the  key  and  Charlie  wins  if  nobody  presses  the  key. 

(a)  Represent  this  situation  as  a game-frame.  Note  that  we  can  represent  a three- 
player  game  with  a set  of  tables:  Player  1 chooses  the  row,  Player  2 chooses  the 
column  and  Player  3 chooses  the  table  (that  is,  we  label  the  rows  with  Player  l’s 
strategies,  the  columns  with  Player  2’s  strategies  and  the  tables  with  Player  3’s 
strategies). 

(b)  Using  the  game-frame  of  part  (a)  obtain  a reduced-form  game  by  adding  the 
information  that  each  player  prefers  winning  to  not  winning  and  is  indifferent 
between  any  two  outcomes  where  he/ she  does  not  win.  For  each  player  use  a 
utility  function  with  values  from  the  set  {0,1}. 

(c)  Using  the  game-frame  of  part  (a)  obtain  a reduced-form  game  by  adding  the 
information  that  (1)  each  player  prefers  winning  to  not  winning,  (2)  Alice  is 
indifferent  between  any  two  outcomes  where  she  does  not  win,  (3)  conditional 
on  not  winning,  Bob  prefers  if  Charlie  wins  rather  than  Alice,  (4)  conditional 
on  not  winning,  Charlie  prefers  if  Bob  wins  rather  than  Alice.  For  each  player 
use  a utility  function  with  values  from  the  set  {0,1,2}. 

I.E.2.  Exercises  for  Section  1.2:  Strict/weak  dominance 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  1.4.  There  are  two  players.  Each  player  is  given  an  unmarked  envelope 
and  asked  to  put  in  it  either  nothing  or  $300  of  his  own  money  or  $600  of  his  own 
money.  A referee  collects  the  envelopes,  opens  them,  gathers  all  the  money,  then 
adds  50%  of  that  amount  (using  his  own  money)  and  divides  the  total  into  two 
equal  parts  which  he  then  distributes  to  the  players. 

(a)  Represent  this  game  frame  with  two  alternative  tables:  the  first  table  showing  in 
each  cell  the  amount  of  money  distributed  to  Player  1 and  the  amount  of 
money  distributed  to  Player  2,  the  second  table  showing  the  change  in  wealth  of 
each  player  (money  received  minus  contribution). 

(b)  Suppose  that  Player  1 has  some  animosity  towards  the  referee  and  ranks  the 
outcomes  in  terms  of  how  much  money  the  referee  loses  (the  more,  the  better), 
while  Player  2 is  selfish  and  greedy  and  ranks  the  outcomes  in  terms  of  her  own 
net  gain.  Represent  the  corresponding  game  using  a table. 

(c)  Is  there  a strict  dominant-strategy  equilibrium? 
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Exercise  1.5.  (a)  For  the  game  of  Part  (b)  of  Exercise  1.1  determine,  for  each  player, 
whether  the  player  has  strictly  dominated  strategies. 

(b)  For  the  game  of  Part  (b)  of  Exercise  1.1  determine,  for  each  player,  whether  the 
player  has  weakly  dominated  strategies. 

Exercise  1.6.  There  are  three  players.  Each  player  is  given  an  unmarked  envelope 
and  asked  to  put  in  it  either  nothing  or  $3  of  his  own  money  or  $6  of  his  own 
money.  A referee  collects  the  envelopes,  opens  them,  gathers  all  the  money  and 
then  doubles  the  amount  (using  his  own  money)  and  divides  the  total  into  three 
equal  parts  which  he  then  distributes  to  the  players.  For  example,  if  Players  1 and  2 
put  nothing  and  Player  3 puts  $6,  then  the  referee  adds  another  $6  so  that  the  total 
becomes  $12,  divides  this  sum  into  three  equal  parts  and  gives  $4  to  each  player. 
Each  player  is  selfish  and  greedy,  in  the  sense  that  he  ranks  the  outcomes 
exclusively  in  terms  of  his  net  change  in  wealth  (what  he  gets  from  the  referee 
minus  what  he  contributed). 

(a)  Represent  this  game  by  means  of  a set  of  tables.  (Do  not  treat  the  referee  as  a 
player.) 

(b)  For  each  player  and  each  pair  of  strategies  determine  if  one  of  the  two 
dominates  the  other  and  specify  if  it  is  weak  or  strict  dominance. 

(c)  Is  there  a strict  dominant-strategy  equilibrium? 

I.E.3.  Exercises  for  Section  1.3:  Second  price  auction 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  1.7.  For  the  second-price  auction  partially  illustrated  in  Table  1.9, 
complete  the  representation  by  adding  the  payoffs  of  Player  2,  assuming  that  Player 
2 assigns  a value  of  $50M  to  the  field  and,  like  Player  1,  ranks  the  outcomes  in  terms 
of  the  net  gain  from  the  oil  field  (defined  as  profits  minus  the  price  paid,  if  Player  2 
wins,  and  zero  otherwise). 

Exercise  1.8.  Consider  the  following  “third-price”  auction.  There  are  n > 3 bidders. 
A single  object  is  auctioned  and  Player  i values  the  object  $vi , with  vi  > 0.  The  bids 

are  simultaneous  and  secret.  The  utility  of  Player  i is:  0 if  she  does  not  win  and 
(v,  ~ p)  if  she  wins  and  pays  $p.  Every  non-negative  number  is  an  admissible  bid. 
Let  b;  denote  the  bid  of  Player  i.  The  winner  is  the  highest  bidder.  In  case  of  ties 
the  bidder  with  the  lowest  index  among  those  who  submitted  the  highest  bid  wins 
(e.g.  if  the  highest  bid  is  $120  and  it  is  submitted  by  players  6,  12  and  15,  then  the 
winner  is  Player  6).  The  losers  don’t  get  anything  and  don’t  pay  anything.  The 
winner  gets  the  object  and  pays  the  third  highest  bid,  which  is  defined  as  follows. 
Let  i be  the  winner  and  fix  a Player  j such  that  bj  = max({h1,...,hn}\{h,.})  [if 

ma.x(\b],...,bn}\\bj})  contains  more  than  one  element,  then  we  pick  one  of  them]. 
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Then  the  third  price  is  defined  as  ma x({bl,...,bn}\{bi,bj}) . For  example,  if  n = 3 
and  the  bids  are  bx  = 30  , b2-  40  and  b:,  = 40  then  the  winner  is  Player  2 and  she 
pays  $30;  if  bl=b2=b3=  50  then  the  winner  is  Player  1 and  she  pays  $50. 

For  simplicity,  let  us  restrict  attention  to  the  case  where  n = 3 and  v,  > v2  > v3  > 0 . 
Does  Player  1 have  a weakly  dominant  strategy  in  this  auction? 

I.E.4.  Exercises  for  Section  1.4:  The  pivotal  mechanism 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  1.9.  The  pivotal  mechanism  is  used  to  decide  whether  a new  park  should 
be  built.  There  are  5 individuals.  According  to  the  proposed  project,  the  cost  of  the 
park  would  be  allocated  as  follows: 


individual 

1 

2 

3 

4 

5 

share  of  cost 

O 

II 

O 

c2=  $25 

c3  = $25 

c4  = $15 

c5  - $5 

For  every  individual  i = 1,  ..,  5,  let  v;  be  the  perceived  gross  benefit  (if  positive; 
perceived  gross  loss,  if  negative)  from  having  the  park  built.  The  v/s  are  as  follows: 


individual 

1 

2 

3 

4 

5 

gross  benefit 

o 

'xO 

II 

> 

v2=  $15 

v3  = $55 

v4  - $-25 

v5  = $-20 

(Thus  the  net  benefit  (loss)  to  individual  i is  v,  - c, ) . Individual  i has  the 
following  utility  of  wealth  function  (where  mi  denotes  the  wealth  of  individual 
z): 

| mi  if  the  project  is  not  carried  out 
\ mi  + v,  if  the  project  is  carried  out 

(Let  mt  be  the  initial  endowment  of  money  of  individual  i and  assume  that  mi 
is  large  enough  that  it  exceeds  p plus  any  tax  that  the  individual  might  have  to 
pay.) 

(a)  What  is  the  Pareto-efficient  decision:  to  build  the  park  or  not? 

Assume  that  the  pivotal  mechanism  is  used,  so  that  each  individual  i is  asked  to 
state  a number  wt  which  is  going  to  be  interpreted  as  the  gross  benefit  to 
individual  i from  carrying  out  the  project.  The  are  no  restrictions  on  the 
number  w-  it  can  be  positive,  negative  or  zero.  Suppose  that  the  individuals 
make  the  following  announcements: 
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individual 

1 

2 

3 

4 

5 

stated  benefit 

w,  = 
$70 

w2=  $10 

w3  = $65 

o 

UD 

1 

-to- 

ll 

is 

w5  = $5 

(b)  Would  the  park  be  built  based  on  the  above  announcements? 

(c)  Using  the  above  announcements  and  the  rules  of  the  pivotal  mechanism,  fill 
in  the  following  table: 


individual 

1 

2 

3 

4 

5 

Pivotal? 

Tax 

(d)  As  you  know,  in  the  pivotal  mechanism  each  individual  has  a dominant 
strategy.  If  all  the  individuals  played  their  dominant  strategies,  would  the 
park  be  built? 

(e)  Assuming  that  all  the  individuals  play  their  dominant  strategies,  find  out 
who  is  pivotal  and  what  tax  (if  any)  each  individual  has  to  pay? 

(f)  Show  that  if  every  other  individual  reports  his/her  true  benefit,  then  it  is 
best  for  Individual  1 to  also  report  his/her  true  benefit. 


I.E.5.  Exercises  for  Section  1.5:  Iterated  deletion  procedures 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  1.10.  (a)  Apply  the  IDSDS  procedure  (Iterated  Deletion  of  Strictly 
Dominated  Strategies)  to  the  game  of  Part  (b)  of  Exercise  1.1. 

(b)  Apply  the  IDWDS  procedure  (Iterated  Deletion  of  Weakly  Dominated 
Strategies)  to  the  game  of  Part  (b)  of  Exercise  1.1. 
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Exercise  1.11.  Apply  the  IDSDS  procedure  to  the  following  game.  Is  there  a strict 
iterated  dominant-strategy  equilibrium? 


Player 

1 


Player  2 

d e f 


8 , 6 

0 , 9 

CO 

3 , 2 

2 , 1 

4 , 3 

2 , 8 

1 , 5 

3 , 1 

Exercise  1.12.  Consider  the  following  game.  There  is  a club  with  three  members: 
Ann,  Bob  and  Carla.  They  have  to  choose  which  of  the  three  is  going  to  be 
president  next  year.  Currently,  Ann  is  the  president.  Each  member  is  both  a 
candidate  and  a voter.  Voting  is  as  follows:  each  member  votes  for  one  candidate 
(voting  for  oneself  is  allowed);  if  two  or  more  people  vote  for  the  same  candidate 
then  that  person  is  chosen  as  the  next  president;  if  there  is  complete  disagreement, 
in  the  sense  that  there  is  exactly  one  vote  for  each  candidate,  then  the  person  for 
whom  Ann  voted  is  selected  as  the  next  president. 

(a)  Represent  this  voting  procedure  as  a game  frame,  indicating  inside  each  cell  of 
each  table  which  candidate  is  elected. 

(b)  Assume  that  the  players’  preferences  are  as  follows:  Ann  >Ann  Carla  >~Ann  Bob , 
Carla  yBob  Bob  yBob  Ann  , Bob  >Carla  Ann  >Carla  Carla . 

Using  utility  values  0, 1 and  2,  convert  the  game  frame  into  a game. 

(c)  Apply  the  IDWDS  to  the  game  of  part  (b).  Is  there  an  iterated  weak  dominant- 
strategy  equilibrium? 

(d)  Does  the  extra  power  given  to  Ann  (in  the  form  of  tie-breaking  in  case  of 
complete  disagreement)  benefit  Ann? 

Exercise  1.13.  Consider  the  following  game: 

Player  2 

D E F 

a 

Player  1 b 
c 

(a)  Apply  the  IDSDS  procedure  to  it.  Is  there  an  iterated  strict  dominant-strategy 
equilibrium? 

(b)  Apply  the  IDWDS  procedure  to  it.  Is  there  an  iterated  weak  dominant-strategy 
equilibrium? 
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l.E.6.  Exercises  for  Section  1.6:  Nash  equilibrium. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  1.14.  Find  the  Nash  equilibria  of  the  game  of  Exercise  1.2. 

Exercise  1.15.  Find  the  Nash  equilibria  of  the  games  of  Exercise  1.3  (b)  and  (c). 

Exercise  1.16.  Find  the  Nash  equilibria  of  the  games  of  Exercise  1.4  (b). 

Exercise  1.17.  Find  the  Nash  equilibria  of  the  game  of  Exercise  1.6. 

Exercise  1.18.  Find  the  Nash  equilibria  of  the  game  of  Exercise  1.7. 

Exercise  1.19.  Find  one  Nash  equilibrium  of  the  game  of  Exercise  1.8  for  the  case 
where 

n = 3 and  v,  > v2  > v3  > 0 . 

Exercise  1.20.  Find  the  Nash  equilibria  of  the  game  of  part  (b)  of  Exercise  1.12. 
Exercise  1.21.  Find  the  Nash  equilibria  of  the  game  of  Exercise  1.13. 

I.E.7.  Exercises  for  Section  1.6:  Games  with  infinite  strategy 
sets. 

Exercise  1.22.  Consider  a simultaneous  n- player  game  where  each  player  i chooses 
an  effort  level  ai  e [0,1].  The  payoff  to  player  i is  given  by 
7ti{al,...,an)  = 4min{a1,...,an}  -2at  (efforts  are  complementary  and  each  player’s 
cost  per  unit  of  effort  is  2). 

(a)  Find  all  the  Nash  equilibria  and  prove  that  they  are  indeed  Nash  equilibria. 

(b)  Are  any  of  the  Nash  equilibria  Pareto  efficient? 

(c)  Find  a Nash  equilibrium  where  each  player  gets  a payoff  of  1. 
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000000000000 

Exercise  1.23:  Challenging  Question.  The  Mondevil  Corporation  operates  a 
chemical  plant,  which  is  located  on  the  banks  of  the  Sacramento  river.  Downstream 
from  the  chemical  plant  is  a group  of  fisheries.  The  Mondevil  plant  emits  by- 
products that  pollute  the  river,  causing  harm  to  the  fisheries.  The  profit  Mondevil 
obtains  from  operating  the  chemical  plant  is  $77  > 0.  The  harm  inflicted  on  the 
fisheries  due  to  water  pollution  is  equal  to  $7  > 0 of  lost  profit  [without  pollution 
the  fisheries’  profit  is  $A,  while  with  pollution  it  is  $(T  -7)].  Suppose  that  the 
fisheries  collectively  sue  the  Mondevil  Corporation.  It  is  easily  verified  in  court  that 
Mondevil's  plant  pollutes  the  river.  However,  the  values  of  77  and  7 cannot  be 
verified  by  the  court,  although  they  are  commonly  known  to  the  litigants.  Suppose 
that  the  court  requires  the  Mondevil  attorney  (Player  1)  and  the  fisheries’  attorney 
(Player  2)  to  play  the  following  litigation  game.  Player  1 is  asked  to  announce  a 
number  x > 0,  which  the  court  interprets  as  a claim  about  the  plant’s  profits.  Player 
2 is  asked  to  announce  a number  y > 0,  which  the  court  interprets  as  the  fisheries' 
claim  about  their  profit  loss.  The  announcements  are  made  simultaneously  and 
independently.  Then  the  court  uses  Posner's  nuisance  rule  to  make  its  decision  (R. 
Posner,  Economic  analysis  of  Law,  9th  edition,  1997).  According  to  the  rule,  if 
y > x,  then  Mondevil  must  shut  down  its  chemical  plant.  If  x > y , then  the  court 
allows  Mondevil  to  operate  the  plant,  but  the  court  also  requires  Mondevil  to  pay 
the  fisheries  the  amount  y.  Note  that  the  court  cannot  force  the  attorneys  to  tell  the 
truth:  in  fact,  it  would  not  be  able  to  tell  whether  or  not  the  lawyers  were  reporting 
truthfully.  Assume  that  the  attorneys  want  to  maximize  the  payoff  (profits)  of  their 
clients. 

(a)  Represent  this  situation  as  a normal-form  game  by  describing  the  strategy 
set  of  each  player  and  the  payoff  functions. 

(b)  Is  it  a dominant  strategy  for  the  Mondevil  attorney  to  make  a truthful 
announcement  (i.e.  to  choose  x = 77)?  [Prove  your  claim.] 

(c)  Is  it  a dominant  strategy  for  the  fisheries’  attorney  to  make  a truthful 
announcement  (i.e.  to  choose  y = 7)?  [Prove  your  claim.] 

(d)  For  the  case  where  77  > 7 (recall  that  77and  7 denote  the  true  amounts), 
find  all  the  Nash  equilibria  of  the  litigation  game.  [Prove  that  what  you 
claim  to  be  Nash  equilibria  are  indeed  Nash  equilibria  and  that  there  are  no 
other  Nash  equilibria.] 

(e)  For  the  case  where  77  < 7 (recall  that  77 and  7 denote  the  true  amounts), 
find  all  the  Nash  equilibria  of  the  litigation  game.  [Prove  that  what  you 
claim  to  be  Nash  equilibria  are  indeed  Nash  equilibria  and  that  there  are  no 
other  Nash  equilibria.] 

(f)  Does  the  court  rule  give  rise  to  a Pareto  efficient  outcome?  [Assume  that  the 
players  end  up  playing  a Nash  equilibrium.] 


50 


GAME  THEORY  - Giacomo  Bonanno 


Appendix  l.S:  Solutions  to  exercises 

Exercise  1.1.  (a)  / = { 1,2},  S1  = {2,4,6},  S2  = {1,3,5},  O = (where  M 

stands  for  ‘Mexican’,  I for  ‘Italian’  and  / for  ‘Japanese);  the  set  of  strategy 
profiles  is  S = {(  2, 1),  (2, 3),  (2, 5),  (4, 1),  (4, 3),  (4, 5),  (6,1),  (6, 3),  (6, 5)} ; the 
outcome  function  is  / (2, 1)  = / (2, 3)  = / (4, 1)  = M , / (2, 5)  = / (4, 3)  = 

/ (6, 1)  = / and  / (4, 5)  = /(6,3)  = /(6,5)  = /.  The  representation  as  a table  is 
as  follows: 


Player  2 (Bob) 

1 3 5 

2 

Player  1 
(Antonia) 

6 


M 

M 

I 

M 

I 

J 

I 

J 

J 

(b)  Using  values  1,  2 and  3,  the  utility  functions  are  as  follows,  where  U]  is  the 
utility  function  of  Player  1 (Antonia)  and  U2  is  the  utility  function  of  Player  2 
M I 


(Bob): 


U,: 

V^2 


3 

2 


2 

3 


. The  reduced-form  game  is  as  follows: 


Player  1 
(Antonia) 


Player  2 (Bob) 

1 3 5 


3 

2 

3 

2 

2 

3 

3 

2 

2 

3 

1 

1 

2 

3 

1 

1 

1 

1 
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Exercise  1.2. 

(a)  The  game-frame  is  as  follows: 

Player  2 (Bob) 


Player  1 
(Antonia) 


0 1 2 


Antonia 
gets  nothing 

Bob  gets  $2 

Antonia 
gets  nothing 

Bob  gets  $3 

Antonia 
gets  nothing 

Bob  gets  $2 

Antonia 
gets  nothing 

Bob  gets  $2 

Antonia 
gets  $5 

Bob  gets  $5 

Antonia 
gets  $4 

Bob  gets  $2 

Antonia 
gets  $4 

Bob  gets  $2 

Antonia 
gets  $3 

Bob  gets  $7 

Antonia 
gets  $2 

Bob 

gets  nothing 

(b)  When  the  outcomes  are  sums  of  money  and  Player  i is  selfish  and  greedy 
then  we  can  take  the  following  as  z’s  utility  function:  £/ ($x)  = x (other  utility 
functions  would  do  too:  the  only  requirement  is  that  the  utility  of  a larger  sum 
of  money  is  larger  than  the  utility  of  a smaller  sum  of  money).  Thus  the 
reduced-form  game  is  as  follows: 


Player  2 (Bob) 

0 

1 

2 

2 

0 2 

0 

3 

0 2 

Player  1 
(Antonia) 

4 

0 2 

5 

5 

4 2 

6 

4 2 

3 

7 

2 0 

Exercise  1.3. 

(a)  The  game-frame  is  as 

follows: 

P = Press 

P 

Bob 

not  P 

P 

Alice  wins 

Bob  wins 

Alice  not  P 

Bob  wins 

Alice  wins 

Charlie:  P 

P 

Bob 

not  P 

P 

Bob  wins 

Alice  wins 

Alice  not  P 

Alice  wins 

Charlie  wins 

Charlie:  not  P 
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(b)  The  reduced-form  game  is  as  follows: 


P = Press 


P 

Alice  not  P 


Charlie:  P 


P 

Alice  not  P 


(c)  The  reduced-form  game  is  as  follows.  For  Alice  we  chose  1 and  0 as  utilities, 
but  one  could  also  use  2 and  1 or  2 and  0. 

P = Press 


P 

Alice  not  P 


Charlie:  P 


P 

Alice  not  P 


Charlie:  not  P 


Bob 


P not  P 


0 

2 

1 

1 

0 

0 

1 

0 

0 

0 

1 

2 

Bob 


P not  P 


1 

0 

0 

0 

2 

1 

0 

2 

1 

1 

0 

0 

Bob 


P not  P 


0 

1 

0 

1 

0 

0 

1 

0 

0 

0 

0 

1 

Charlie:  not  P 


Bob 


P not  P 


1 

0 

0 

0 

1 

0 

0 

1 

0 

1 

0 

0 
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Exercise  1.4.  (a)  The  tables  are  as  follows: 


Player 

1 


Distributed  money 


Player  2 


0 300  600 


0 

0 

225 

225 

450 

450 

225 

225 

450 

450 

675 

675 

450 

450 

675 

675 

900 

900 

i 


Player 

1 


Net  amounts 


Player  2 

0 300  600 


0 

0 

225 

-75 

450 

-150 

-75 

225 

150 

150 

375 

75 

-150 

450 

75 

375 

300 

300 

(b)  For  Player  1 we  can  take  as  his  payoff  the  total  money  lost  by  the  referee 
and  for  Player  2 her  own  net  gain: 


Player  2 

0 

300 

600 

i Player 

0 

0 

0 

150 

-75 

300 

-150 

300 

150 

225 

300 

150 

450 

75 

600 

300 

450 

450 

375 

600 

300 

(c)  For  Player  1 contributing  $600  is  a strictly  dominant  strategy  and  for  Player 
2 contribution  $0  is  a strictly  dominant  strategy.  Thus  ($600, $0)  is  the  strict 
dominant-strategy  equilibrium. 


Exercise  1.5.  The  game  under  consideration  is  the  following: 


2 

Player  1 { 

(Antonia) 

6 

(a)  For  Player  1,  6 is  strictly  dominated  by  2.  There  is  no  other  strategy  which 
is  strictly  dominated.  Player  2 does  not  have  any  strictly  dominated 
strategies. 


Player  2 (Bob) 
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(b)  For  Player  1,  6 is  weakly  dominated  by  4 (and  also  by  2,  since  strict 
dominance  implies  weak  dominance);  4 is  weakly  dominated  by  2.  Player  2 
does  not  have  any  weakly  dominated  strategies. 


Exercise  1.6.  (a)  The  game  is  as  follows: 


Player  2 

0 3 6 


0 

jPlay 
er  3 
1 

6 


0 

0 

0 

2 

-1  2 

4 

-2 

4 

-1 

2 

2 

1 

1 

4 

3 

0 

6 

-2 

4 

4 

0 

3 

6 

2 

2 

8 

Player  2 

0 3 6 


2 2 

-1 

4 

1 

1 

6 

0 

3 

1 4 

1 

3 

3 

3 

5 

2 

5 

0 6 

3 

2 

5 

5 

4 

4 

7 

Player  2 


0 3 6 


4 

4 

-2 

6 

3 

0 

8 

2 

2 

3 

6 

0 

5 

5 

2 

7 

4 

4 

2 

8 

2 

4 

7 

4 

6 

6 

6 

| Player  3:  0 Player  3:  3 Player  3:  6 j 

(b)  For  Player  1,  3 strictly  dominates  6,  0 strictly  dominates  6,  0 strictly 
dominates  3 (the  same  is  true  for  every  player). 

(c)  The  strict  dominant-strategy  equilibrium  is  (0,0,0)  (everybody  contributes 

0). 


Exercise  1.7.  The  game  is  as  follows: 

$10M  $20M  $30M  $40M  $50M 

$10M 

Player  $20M 

1 $30M 

(value  $30M)  $40M 

$50M 


0,40 

0,40 

0,40 

0,40 

0,40 

20,0 

0,30 

0,30 

0,30 

0,30 

20,0 

10,0 

0,20 

0,20 

0,20 

20,0 

10,0 

0 , 0 

0,  10 

0,  10 

20,0 

10,0 

0 , 0 

-10,0 

0 , 0 

Exercise  1.8.  No.  Suppose,  by  contradiction,  that  bx  is  a weakly  dominant 
strategy  for  Player  1.  It  cannot  be  that  b{>v{,  because  when  b2  =b3  -b{  Player 
1 wins  and  pays  b{ , thereby  obtaining  a payoff  of  y -b{  < 0 , whereas  bidding  0 
would  give  him  a payoff  of  0.  It  cannot  be  that  b{  - y because  when  b2  > bx  and 
b,<v  i the  auction  is  won  by  Player  2 and  Player  1 gets  a payoff  of  0,  while  a 
bid  of  Player  1 greater  than  b2  would  make  him  the  winner  with  a payoff  of 

v,  - b3  > 0 . Similarly,  it  cannot  be  that  b{  < y because  when  b2  > b{  and  b3  < y 
then  the  auction  is  won  by  Player  2 and  Player  1 gets  a payoff  of  0,  while  a bid 
greater  than  b2  would  make  him  the  winner  with  a payoff  of  y - b3  > 0 . 
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5 5 

Exercise  1.9.  (a)  Since  =85  <Zc<  = 100  the  Pareto  efficient  decision  is 

i=l  i= 1 

not  to  build  the  park. 

5 5 

(b)  Since  ^ vy  = 120  > =100  the  park  would  be  built. 

i=l  i=l 


(c)  Individuals  1 and  3 are  pivotal  and  each  of  them  has  to  pay  a tax  of  $20.  The 
other  individuals  are  not  pivotal  and  thus  are  not  taxed. 


(d)  For  each  individual  i it  is  a dominant  strategy  to  report  v(.  and  thus,  by  part 
(a),  the  decision  will  be  the  Pareto  efficient  one,  namely  not  to  build  the  park. 

(e)  When  every  individual  reports  truthfully,  Individuals  4 and  5 are  pivotal 
and  Individual  4 has  to  pay  a tax  of  $25,  while  individual  5 has  to  pay  a tax  of 
$10.  The  others  are  not  pivotal  and  do  not  have  to  pay  a tax. 

(f)  Assume  that  all  the  other  individuals  report  truthfully;  then  if  Individual  1 
reports  truthfully,  he  is  not  pivotal,  the  project  is  not  carried  out  and  his  utility 
is  mx . Any  other  w,  that  leads  to  the  same  decision  (not  to  build  the  park) 
gives  him  the  same  utility.  If,  on  the  other  hand,  he  chooses  a Wj  that  leads  to  a 
decision  to  build  the  park,  then  Individual  1 will  become  pivotal  and  will  have 
to  pay  a tax  of  $45  with  a utility  of  ml  + v1  - 45  = ml  + 30  - 45  = ml  - 15  , so  that 
he  would  be  worse  off  relative  to  reporting  truthfully. 


Exercise  1.10.  The  game  under  consideration  is  the  following: 


Player  1 
(Antonia) 


Player  2 (Bob) 

1 3 5 


3 

2 

3 

2 

2 

3 

3 

2 

2 

3 

1 

1 

2 

3 

1 

1 

1 

1 

(a)  The  first  step  of  the  procedure  eliminates  6 for  Player  1.  After  this  step  the 
procedure  stops  and  thus  the  output  is 
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Player  1 ^ 

(Antonia)  ^ 


(b)  The  first  step  of  the  procedure  eliminates  4 and  6 for  Player  1 and  nothing 
for  Player  2.  The  second  step  of  the  procedure  eliminates  1 and  3 for  Player 
2.  Thus  the  output  is  the  strategy  profile  (2,5),  which  constitutes  the  iterated 
weak  dominant-strategy  equilibrium  of  this  game. 

Exercise  1.11.  The  game  under  consideration  is 


Player  a 

1 b 

c 

In  this  game  c is  strictly  dominated  by  b;  after  deleting  c,  d becomes  strictly 
dominated  by  f\  after  deleting  d,  a becomes  strictly  dominated  by  b;  after 
deleting  a,  e becomes  strictly  dominated  by  f\  deletion  of  e leads  to  only  one 
strategy  profile,  namely  (bf) . Thus  (b,f)  is  the  iterated  strict  dominant-strategy 
equilibrium. 


Player  2 

d e f 


8 , 6 

0 , 9 

3 , 8 

3 , 2 

2 , 1 

4 , 3 

2 , 8 

1 , 5 

3 , 1 

Player  2 (Bob) 

1 3 5 


Exercise  1.12.  (a)  The  game-frame  is  as  follows: 


BOB 


BOB 


ABC 


A 

A 

N B 
N 

C 


Carla  votes  for  A 


A 

A 

A 

A 

B 

B 

A 

C 

C 

ABC 


A 

A 

N B 
N 

C 


Carla  votes  for  B 


A 

B 

A 

B 

B 

B 

C 

B 

C 

BOB 


ABC 


A 

A 

N B 
N 

C 


Carla  votes  for  C 


A 

A 

C 

B 

B 

C 

C 

C 

C 

(b)  The  game  is  as  follows: 
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BOB 


ABC 


A 

A 

N B 
N 

C 


Carla  votes  for  A 


2,0,1 

2,0,1 

2,0,1 

2,0,1 

0,1,2 

0,1,2 

2,0,1 

1,2,0 

1,2,0 

BOB 


ABC 


A 

A 

N B 
N 

C 


Carla  votes  for  B 


2,0,1 

0,1,2 

2,0,1 

0,1,2 

0,1,2 

0,1,2 

1,2,0 

0,1,2 

1,2,0 

BOB 


ABC 


A 

A 

N B 
N 

C 


Carla  votes  for  C 


2,0,1 

2,0,1 

1,2,0 

0,1,2 

0,1,2 

1,2,0 

1,2,0 

1,2,0 

1,2,0 

(c)  For  Ann,  both  B and  C are  weakly  dominated  by  A,  while  the  other  two 
players  do  not  have  any  dominated  strategies.  Thus  in  the  first  step  of  the 
IDWDS  we  delete  B and  C for  Ann.  Hence  the  game  reduces  to: 


A 

B 

O B 
B 

C 


In  this  game,  for  Carla,  B weakly  dominates  both  A and  C and  for  Bob  A is 
weakly  dominated  by  B (and  also  by  C).  Thus  in  the  second  step  of  the 
IDWDS  we  delete  A and  B for  Carla  and  A for  Bob.  In  the  third  and  final 
step  we  delete  C for  Bob.  Thus  we  are  left  with  a unique  strategy  profile, 
namely  (A,B,B),  that  is,  Ann  votes  for  herself  and  Bob  and  Carla  vote  for 
Bob.  This  is  the  iterated  weak  dominant-strategy  equilibrium. 

(d)  The  elected  candidate  is  Bob,  who  is  Ann’s  least  favorite;  thus  the  extra 
power  given  to  Ann  (tie  breaking  in  case  of  total  disagreement)  turns  out  to 
be  detrimental  for  Ann! 


CARLA 


ABC 


0,1 

0,1 

0,1 

0,1 

1,2 

0,1 

0,1 

0,1 

2,0 

Carla  votes  for  A 


58 


GAME  THEORY  - Giacomo  Bonanno 


Exercise  1.13.  The  game  under  consideration  is 


a 

Player  1 b 
c 


Player  2 


(a)  The  output  of  the  IDSDS  is  as  follows  (first  delete  c and  then  F): 


Player  1 


Player  2 


D 

E 

a 

2 

3 

2 

2 

b 

2 

0 

3 

1 

Thus  there  is  no  iterated  strict  dominant-strategy  equilibrium. 

(b)  The  output  of  the  IDWDS  is  (b,E)  (in  the  first  step  delete  c and  F,  the  latter 
because  it  is  weakly  dominated  by  D ; in  the  second  step  delete  a and  in  the 
third  step  delete  D).  Thus  ( b,E ) is  the  iterated  weak  dominant-strategy 
equilibrium. 


Exercise  1.14.  The  game  under  consideration  is 


Player  1 
(Antonia) 


Player  2 (Bob) 


There  is  only  one  Nash  equilibrium,  namely  (4,1)  with  payoffs  (5,5). 
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Exercise  1.15.  The  game  of  part  (b)  of  Exercise  1.3  is  as  follows: 


P = Press 

Bob 

P not  P 

P 

Alice  not  P 

Charlie:  P 

Bob 

P not  P 

P 

Alice  not  P 


This  game  has  only  one  Nash  equilibrium,  namely  (not  P,  P,  not  P). 
The  game  of  part  (c)  of  Exercise  1.3  is  as  follows: 


0 

1 

0 

1 

0 

0 

1 

0 

0 

0 

0 

1 

Charlie:  not  P 


1 

0 

0 

0 

1 

0 

0 

1 

0 

1 

0 

0 

P = Press 

Bob 

P not  P 


This  game  does  not  have  any  Nash  equilibria. 
Exercise  1.16.  The  game  under  consideration  is 


Player  2 

0 

300 

600 

I Player 
1 

0 

0 

0 

150 

-75 

300 

-150 

300 

150 

225 

300 

150 

450 

75 

600 

300 

450 

450 

375 

600 

300 

This  game  has  only  one  Nash  equilibrium,  namely  (600,0). 
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Exercise  1.17.  The  game  under  consideration  is: 


Player  2 

0 3 6 


0 0 0 

2 -1  2 

4-2  4 

-12  2 

1 1 4 

3 0 6 

-2  4 4 

0 3 6 

2 2 8 

Player  2 

0 3 6 


2 2-1 

4 1 1 

6 0 3 

1 4 1 

3 3 3 

5 2 5 

0 6 3 

2 5 5 

4 4 7 

Player  2 


0 3 6 


4 

4 

-2 

6 

3 

0 

oo 

2 

2 

3 

6 

0 

5 

5 

2 

7 

4 

4 

2 

8 

2 

4 

7 

4 

6 

6 

6 

Player  3:  0 


Player  3:  3 


Player  3:  6 j 


This  game  has  only  one  Nash  equilibrium,  namely  (0,0,0). 


Exercise  1.18.  The  game  under  consideration  is  as  follows: 


$10M  $20M  $30M  $40M  $50M 

$10M 

Player  $20M 
1 $30M 

(value  $30M)  $40M 
$50M 


0,40 

0,40 

0,40 

0,40 

0,40 

20,0 

0,30 

0,30 

0,30 

0,30 

20,0 

10,0 

0,20 

0,20 

0,20 

20,0 

10,0 

0 , 0 

0,  10 

0,  10 

20,0 

10,0 

0 , 0 

-10,0 

0 , 0 

This  game  has  15  Nash  equilibria:  (10,30),  (10,40),  (10,50),  (20,30),  (20,40), 
(20,50),  (30,30),  (30,40),  (30,50),  (40,40),  (40,50),  (50,10),  (50,20),  (50,30),  (50,50). 


Exercise  1.19.  A Nash  equilibrium  is  bx=b2-b3-vx  (with  payoffs  (0,0,0)). 
Convince  yourself  that  this  is  indeed  a Nash  equilibrium.  There  are  many  more 
Nash  equilibria:  for  example,  any  triple  (bx,b2,b3)  with  b2  = b3  = v,  and  bx  >Vj 
is  a Nash  equilibrium  (with  payoffs  (0,0,0))  and  so  is  any  triple  (bx,b2,b3)  with 
b2=b3  = v2  and  bl^V2  (with  payoffs  (v,  - v2 , 0, 0) ) . 
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Exercise  1.20.  The  game  under  consideration  is 


BOB 


ABC 


A 

A 

N B 
N 

C 


Carla  votes  for  A 


2,0,1 

2,0,1 

2,0,1 

2,0,1 

0,1,2 

0,1,2 

2,0,1 

1,2,0 

1,2,0 

BOB 


ABC 


A 

A 

N B 
N 

C 


Carla  votes  for  B 


2,0,1 

0,1,2 

2,0,1 

0,1,2 

0,1,2 

0,1,2 

1,2,0 

0,1,2 

1,2,0 

BOB 


ABC 


A 

A 

N B 
N 

C 


Carla  votes  for  C 


2,0,1 

2,0,1 

1,2,0 

0,1,2 

0,1,2 

1,2,0 

1,2,0 

1,2,0 

1,2,0 

There  are  5 Nash  equilibria:  (A, A, A),  (B,B,B),  (C,C,C),  (A,C,A)  and  (A,B,B). 
Exercise  1.21.  The  game  under  consideration  is  as  follows: 


a 

Player  1 b 
c 

There  are  2 Nash  equilibria:  ( a,D ) and  (b,E) . 

Exercise  1.22.  (a)  For  every  ee[0,l],  (e,e, ...,e)  is  a Nash  equilibrium.  The 
payoff  of  Player  i is  = 2e ; if  player  i increases  her  effort  to  a > e (of 

course,  this  can  only  happen  if  e < 1),  then  her  payoff  decreases  to  4e  - 2 a and 
if  she  decreases  her  effort  to  a < e (of  course,  this  can  only  happen  if  e > 0), 
then  her  payoff  decreases  to  la. 

There  is  no  Nash  equilibrium  where  two  players  choose  different  levels  of 
effort.  Proof:  suppose  there  is  an  equilibrium  (al,a2,...,an)  where  at  * a . for 

two  players  i and  j.  Let  amin  =min{a1,...,an}  and  let  k be  a player  such  that 
ak  > amm  (such  a player  exists  by  our  supposition).  Then  the  payoff  to  player  k 
is  nk  = 4 amin  - 2 ak  and  if  she  reduced  her  effort  to  amin  her  payoff  would 
increase  to  2 amin . 

(b)  Any  symmetric  equilibrium  with  e < 1 is  Pareto  inefficient,  because  all  the 
players  would  be  better  off  if  they  simultaneously  switched  to  (1,1,.. .,1).  On 
the  other  hand,  the  symmetric  equilibrium  (1,1,. ..,1)  is  Pareto  efficient. 

(c)  The  symmetric  equilibrium  . 


Player  2 
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Exercise  1.23.  (a)  The  strategy  sets  are  Sx  = S2  = [0,oo).  The  payoff  functions 
are  as  follows: 


Ki(x,y) 


jn-y  if  x>y 
{o  if  y>x 


and  7r2(x,y) 


J A-L  + y if  x>  y 
if  y>x 


(b)  Yes,  for  player  1 choosing  x = 77  is  a weakly  dominant  strategy.  Proof. 
Consider  an  arbitrary  y.  We  must  show  that  x = 77  gives  at  least  a high  a 
payoff  against  y as  any  other  x.  Three  cases  are  possible.  Case  1:  y < 77  In 
this  case  x = 77 or  any  other  x such  that  x > y yields  nx  = TJ-y  > 0,  while 
any  x < y yields  nx  = 0.  Case  2:  y = 77.  In  this  case  l’s  payoff  is  zero  no 

matter  what  x he  chooses.  Case  3:  y > 77  In  this  case  x = 77 or  any  other  x 
such  that  x < y yields  ny  = 0,  while  any  x > y yields  nx  = TJ-y  < 0. 


(c)  No,  choosing  y = 7 is  not  a dominant  strategy  for  Player  2.  For  example,  if 
x > 7 then  choosing  y = L yields  n2  = A while  choosing  a y such  that 

L < y < x yields  n2  = A -L  + y > A. 


(d)  Suppose  that  II  > L.  If  (x,y)  is  a Nash  equilibrium  with  x > y then  it  must 
be  that  y < II  (otherwise  Player  1 could  increase  its  payoff  by  reducing  x 
below  y)  and  y > L (otherwise  Player  2 would  be  better  off  by  increasing  y 
above  x).  Thus  it  must  be  L < y < II,  which  is  possible,  given  our 
assumption.  However,  it  cannot  be  that  x > y,  because  Player  2 would  be 
getting  a higher  payoff  by  increasing  y to  x.  Thus  it  must  be  x < y,  which 
(together  with  our  hypothesis  that  x > y)  implies  that  x = y.  Thus  the 
following  are  Nash  equilibria: 


all  the  pairs  (x,y)  with  7 < y < II  and  x = y. 


Now  consider  pairs  (x,y)  with  x < y.  Then  it  cannot  be  that  y < II, 
because  Player  1 could  increase  its  payoff  by  increasing  x to  y.  Thus  it  must 
be  y > II  (hence,  by  our  supposition  that  II  > 7,  y > L).  Furthermore,  it 
must  be  that  x < 7 (otherwise  Player  2 could  increase  its  profits  by  reducing 
y to  (or  below)  x.  Thus 


(x,y)  with  x < y is  a Nash  equilibrium  if  and  only  if  x < 7 and  y>U. 


(e)  Suppose  that  II  < L.  For  the  same  reasons  given  above,  an  equilibrium 
with  x > y requires  L < y < U.  However,  this  is  not  possible  given  that 
II  < L.  Hence, 

there  is  no  Nash  equilibrium  (x,y)  with  x > y. 
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Thus  we  must  restrict  attention  to  pairs  (x,y)  with  x < y.  As  explained 
above,  it  must  be  that  y > n and  x <L.  Thus, 

(x,y)  with  x < y is  a Nash  equilibrium  if  and  only  if  n < y and  x<  L. 

(f)  Pareto  efficiency  requires  that  the  chemical  plant  be  shut  down  if  n < L 
and  that  it  remain  operational  if  n > L.  Now,  when  n < L all  the 
equilibria  have  x < y which  leads  to  shut-down,  hence  a Pareto  efficient 
outcome.  When  n > L,  there  are  two  types  of  equilibria:  one  where  x = y 
and  the  plant  remains  operational  (a  Pareto  efficient  outcome)  and  the  other 
where  x < y in  which  case  the  plant  shuts  down,  yielding  a Pareto 
inefficient  outcome. 
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Chapter 


Dynamic  games  with 
perfect  information 

2.1  Trees,  frames  and  games 

Often  interactions  are  not  simultaneous  but  sequential.  For  example,  in  the 
game  of  Chess  the  two  players,  White  and  Black,  take  turns  moving 
pieces  on  the  board,  having  full  knowledge  of  the  opponent’s  (and  their 
own)  past  moves.  Games  with  sequential  interaction  are  called  dynamic  games  or 
games  in  extensive  form.  This  chapter  is  devoted  to  the  subclass  of  dynamic  games 
characterized  by  perfect  information,  namely  the  property  that,  whenever  it  is  her 
turn  to  move,  a player  knows  all  the  preceding  moves.  Perfect-information  games 
are  represented  by  means  of  rooted  directed  trees. 

A rooted  directed  tree  consists  of  a set  of  nodes  and  directed  edges  joining  them.  The 
root  of  the  tree  has  no  directed  edges  leading  to  it  (has  indegree  0),  while  every 
other  node  has  exactly  one  directed  edge  leading  to  it  (has  indegree  1).  There  is  a 
unique  path  (that  is,  a unique  sequence  of  directed  edges)  leading  from  the  root  to 
any  other  node.  A node  that  has  no  directed  edges  out  of  it  (has  outdegree  0)  is 
called  a terminal  node,  while  every  other  node  is  called  a decision  node.  We  shall 
denote  the  set  of  nodes  by  X,  the  set  of  decision  nodes  by  D and  the  set  of  terminal 
nodes  by  2.  Thus  X = Dvj  Z . 

Definition  2.1.  A finite  extensive  form  (or  frame)  with  perfect  information  consists  of 
the  following  items. 

• A finite  rooted  directed  tree. 

• A set  of  players  I = {1 ,...,«}  and  a function  that  assigns  one  player  to  every 
decision  node. 

• A set  of  actions  A and  a function  that  assigns  one  action  to  every  directed  edge, 
satisfying  the  restriction  that  no  two  edges  out  of  the  same  node  are  assigned  the 
same  action. 

• A set  of  outcomes  O and  a function  that  assigns  an  outcome  to  every  terminal 
node. 
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Example  2.1.  Amy  (Player  1)  and  Beth  (Player  2)  have  decided  to  dissolve  a 
business  partnership  whose  assets  have  been  valued  at  $100,000.  The  charter  of  the 
partnership  prescribes  that  the  senior  partner,  Amy,  make  an  offer  concerning  the 
division  of  the  assets  to  the  junior  partner,  Beth.  The  junior  partner  can  Accept,  in 
which  case  the  proposed  division  is  implemented,  or  Reject,  in  which  case  the  case 
goes  to  litigation.  Litigating  involves  a cost  of  $20,000  in  legal  fees  for  each  partner 
and  the  typical  verdict  assigns  60%  of  the  assets  to  the  senior  partner  and  the 
remaining  40%  to  the  junior  partner.  Suppose,  for  simplicity,  that  there  is  no 
uncertainty  about  the  verdict  (how  to  model  uncertainty  will  be  discussed  in  a later 
chapter).  Suppose  also  that  there  are  only  two  possible  offers  that  Amy  can  make:  a 
50-50  split  or  a 70-30  split.  This  situation  can  be  represented  as  a finite  extensive 
form  with  perfect  information  as  shown  in  Figure  2.1.  Each  outcome  is  represented 
as  two  sums  of  money:  the  top  one  is  what  Player  1 gets  and  the  bottom  one  what 
Player  2 gets. 


$50,000 

$50,000 


$40,000 

$20,000 


$70,000 

$30,000 


$40,000 

$20,000 


Figure  2.1 

A perfect-information  extensive  form  representing 
the  situation  described  in  Example  2.1 

What  should  we  expect  the  players  to  do  in  the  above  game?  Consider  the 
following  reasoning,  which  is  called  backward  induction  reasoning,  because  it  starts 
from  the  end  of  the  game  and  proceeds  backwards  towards  the  root: 
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If  Player  2 (the  junior  partner)  is  offered  a 50-50  split  then,  if 
she  accepts,  she  will  get  $50,000,  while,  if  she  rejects,  she  will 
get  $20,000  (the  court-assigned  40%  minus  legal  fees  in  the 
amount  of  $20,000);  thus,  if  rational,  she  will  accept.  Similarly, 
if  Player  2 is  offered  a 70-30  split  then,  if  she  accepts,  she  will 
get  $30,000,  while,  if  she  rejects,  she  will  get  $20,000  (the 
court-assigned  40%  minus  legal  fees  in  the  amount  of  $20,000); 
thus,  if  rational,  she  will  accept.  Anticipating  all  of  this,  Player 
1 realizes  that,  if  she  offers  a 50-50  split  then  she  will  end  up 
with  $50,000,  while  if  she  offers  a 70-30  split  then  she  will  end 
up  with  $70,000;  thus,  if  Player  1 is  rational  and  believes  that 
Player  2 is  rational,  she  will  offer  a 70-30  split  and  Player  2, 
being  rational,  will  accept. 

The  above  reasoning  suffers  from  the  same  flaws  as  the  reasoning  described  in 
Chapter  f (Section  1.1):  it  is  not  a valid  argument  because  it  is  based  on  an 
implicit  assumption  about  how  Player  2 ranks  the  outcomes,  which  may  or 
may  not  be  correct.  For  example,  Player  2 may  feel  that  she  worked  as  hard  as 
her  senior  partner  and  the  only  fair  division  is  a 50-50  split;  indeed  she  may  feel 
so  strongly  about  this  that  - if  offered  an  unfair  70-30  split  - she  would  be 
willing  to  sacrifice  $10,000  in  order  to  “teach  a lesson  to  Player  1”;  in  other 
words,  she  ranks  outcome  o4  above  outcome  o3 . 

Using  the  terminology  introduced  in  Chapter  1,  we  say  that  the  situation 
represented  in  Figure  2.1  is  not  a game  but  a game-frame.  In  order  to  convert 
that  frame  into  a game  we  need  to  add  a ranking  of  the  outcomes  for  each 
player. 

Definition  2.2.  A finite  extensive  game  with  perfect  information  is  a finite 
extensive  form  with  perfect  information  together  with  a ranking  <;  of  the  set 
of  outcome  O,  for  every  player  i e / . 

As  usual,  it  is  convenient  to  represent  the  ranking  of  Player  i by  means  of  an 
ordinal  utility  function  Ui . : O — » R . For  example,  take  the  extensive  form  of 
Figure  2.1  and  assume  that  Player  1 is  selfish  and  greedy,  that  is,  her  ranking  is: 

best  o3 

ox  (or,  in  the  alternative  notation,  o3  A,  ox  o2  of), 

worst  o2,o4 

while  Player  2 is  concerned  with  fairness  and  her  ranking  is: 
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best  o1 

o2,o4  (or,  in  the  alternative  notation,  ox  >-2  o2  ~2  o4  A2  o3). 
worst  o3 

Then  we  can  represent  the  players’  preferences  using  the  following  utility 
functions: 


outcome  — » 

°\  °2  °3  °4 

utility  function  4- 

Ul  (Player  1) 

2 13  1 

U2  (Player  2) 

3 2 12 

and  replace  each  outcome  in  Figure  2.1  with  a pair  of  utilities  or  payoffs,  as 
shown  in  Figure  2.2,  thereby  obtaining  one  of  the  many  possible  games  based 
on  the  frame  of  Figure  2.1. 

1 


3 2 1 2 

Figure  2.2 

A perfect-information  game  based  on  the  extensive  form  of  Figure  2.1. 

Now  that  we  do  have  a game,  rather  than  just  a game-frame,  we  can  indeed 
apply  the  backward-induction  reasoning  and  conclude  that  Player  1 will  offer  a 
50-50  split,  anticipating  that  Player  2 would  reject  the  offer  of  a 70-30  split,  and 
Player  2 will  accept  Player  l’s  50-50  offer.  The  choices  selected  by  the 
backward-induction  reasoning  have  been  highlighted  in  Figure  2.2  by  doubling 
the  corresponding  edges. 
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This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  2.E.1  of  Appendix  2.E  at  the 
end  of  this  chapter. 


2.2  Backward  induction 

The  backward-induction  reasoning  mentioned  above  can  be  formalized  as  an 
algorithm  for  solving  any  perfect-information  game,  as  follows.  We  say  that  a 
node  is  marked  if  a payoff  or  utility  vector  is  associated  with  it.  Initially  all 
and  only  the  terminal  nodes  are  marked;  the  following  procedure  provides  a 
way  of  marking  all  the  nodes. 

Definition  2.3.  The  backward-induction  algorithm  is  the  following  procedure 
for  solving  a finite  perfect-information  game. 

1.  Select  a decision  node  x whose  immediate  successors  are  all  marked.  Let  i be 
the  player  who  moves  at  x.  Select  a choice  that  leads  to  an  immediate 
successor  of  x with  the  highest  payoff  or  utility  for  Player  i (highest  among 
the  utilities  associated  with  the  immediate  successors  of  x).  Mark  x with  the 
payoff  vector  associated  with  the  node  following  the  selected  choice. 

2.  Repeat  the  above  step  until  all  the  nodes  have  been  marked. 


Note  that,  since  the  game  is  finite,  the  above  procedure  is  well  defined.  In  the 
initial  steps  one  starts  at  those  decision  nodes  that  are  followed  only  by 
terminal  nodes,  call  them  penultimate  nodes.  After  all  the  penultimate  nodes 
have  been  marked,  there  will  be  unmarked  nodes  whose  immediate  successors 
are  all  marked  and  thus  the  step  can  be  repeated. 

Note  also  that,  in  general,  at  a decision  node  there  may  be  several  choices  that 
maximize  the  payoff  of  the  player  who  moves  at  that  node.  If  that  is  the  case 
then  the  procedure  requires  that  one  such  choice  be  selected.  This  arbitrary 
choice  may  lead  to  the  existence  of  several  backward-induction  solutions.  For 
example,  consider  the  game  of  Figure  2.3. 
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1 


2 0 
1 1 


Figure  2.3 

A perfect-information  game  with  multiple  backward-induction  solutions. 


In  this  game,  starting  at  node  x of  Player  2 we  select  choice  c (since  it  gives 
Player  2 a higher  payoff  than  d).  Then  we  move  on  to  Player  3’s  node  and  we 
find  that  both  choices  there  are  payoff  maximizing  for  Player  3;  thus  there  are 
two  ways  to  proceed.  In  Figure  2.4  we  show  the  steps  of  the  backward- 
induction  algorithm  with  the  selection  of  choice  g,  while  Figure  2.5  shows  the 
steps  of  the  algorithm  with  the  selection  of  choice  h.  As  before,  the  selected 
choices  are  shown  by  double  edges.  In  Figures  2.4  and  2.5  the  marking  of  nodes 
is  shown  explicitly,  but  later  on  we  will  represent  the  backward-induction 
solution  more  succinctly  by  merely  highlighting  the  selected  choices. 
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Figure  2.4 

One  possible  output  of  the  backward-induction 
algorithm  to  the  game  of  Figure  2.3. 
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Figure  2.5 

Another  possible  output  of  the  backward-induction 
algorithm  to  the  game  of  Figure  2.3 

How  should  one  define  the  output  of  the  backward-induction  algorithm  and 
the  notion  of  backward-induction  solution?  What  kind  of  objects  are  they? 
Before  we  answer  this  question  we  need  to  introduce  the  notion  of  strategy  in  a 
perfect-information  game. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  2.E.2  of  Appendix  2.E  at  the 
end  of  this  chapter. 
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2.3  Strategies  in  perfect-information  games 

A strategy  for  a player  in  a perfect-information  game  is  a complete,  contingent 
plan  on  how  to  play  the  game.  Consider,  for  example,  the  following  game  - 
which  reproduces  Figure  2.3  - and  let  us  focus  on  Player  2. 


1 


2 0 

1 1 

Before  the  game  is  played,  Player  2 does  not  know  what  Player  1 will  do  and 
thus  a complete  plan  needs  to  specify  what  she  will  do  if  Player  1 decides  to 
play  a and  what  she  will  do  if  Player  1 decides  to  play  b.  A possible  plan,  or 
strategy,  is  “if  Player  1 chooses  a then  I will  choose  c and  if  Player  1 chooses  b then  I 
will  choose  e”,  which  we  can  denote  more  succinctly  as  (c,e).  The  other  possible 
plans,  or  strategies,  for  Player  2 are  (c,/),  {cl,  e)  and  (d,f).  The  formal  definition  of 
strategy  is  as  follows. 


Definition  2.4.  A strategy  for  a player  in  a perfect-information  game  is  a list  of 
choices,  one  for  each  decision  node  of  that  player. 


For  example,  suppose  that  Player  f has  three  decision  nodes  in  a given  game:  at  one 
node  she  has  three  possible  choices,  a, , a2  and  a, , at  another  node  she  has  two 
possible  choices,  bx  and  b2,  and  at  the  third  node  she  has  four  possible  choices, 
cl5c2,c3  and  c4.  Then  a strategy  for  Player  1 in  that  game  can  be  thought  of  as  a 

f \ 


way  of  filling  in  three  blanks: 


Since  there  are  3 


^ one  of  ,a2  ,a3  one  of  t\  ,b2  one  of  q ,c2  ,c3  ,c4  y 


choices  for  the  first  blank,  2 for  the  second  and  4 for  the  third,  the  total  number  of 
possible  strategies  for  Player  1 in  this  case  would  be  3 x 2 x 4 = 24  . One  strategy  is 
( a2 , bx , g ) , another  strategy  is  ( ax , b2 , c4 ) , etc. 
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It  should  be  noted  that  the  notion  of  strategy  involves  redundancies.  To  see  this, 
consider  the  game  of  Figure  2.6  below.  In  this  game  a possible  strategy  for  Player  1 
is  (a,g),  which  means  that  Player  1 is  planning  to  choose  a at  the  root  of  the  tree  and 
would  choose  g at  her  other  node.  But  if  Player  1 indeed  chooses  a,  then  her  other 
node  will  not  be  reached  and  thus  why  should  Player  1 make  a plan  on  what  to  do 
there?  One  could  justify  this  redundancy  in  the  notion  of  strategy  in  a number  of 
ways:  (1)  Player  1 is  so  cautious  that  she  wants  her  plan  to  cover  also  the  possibility 
that  she  might  make  mistakes  in  the  implementation  of  parts  of  her  plan  (in  this 
case,  she  allows  for  the  possibility  that  - despite  her  intention  to  play  a - she  might 
end  up  playing  b)  or  (2)  we  can  think  of  a strategy  as  a set  of  instructions  given  to  a 
third  party  on  how  to  play  the  game  on  Player  l’s  behalf,  in  which  case  Player  1 
might  indeed  be  worried  about  the  possibility  of  mistakes  in  the  implementation 
and  thus  want  to  cover  all  contingencies,  etc.  An  alternative  justification  relies  on  a 
different  interpretation  of  the  notion  of  strategy:  not  as  a plan  of  Player  1 but  as  a 
belief  in  the  mind  of  Player  2 concerning  what  Player  1 would  do.  For  the  moment 
we  will  set  this  issue  aside  and  simply  use  the  notion  of  strategy  as  given  in 
Definition  2.4. 


1 


2 0 

Figure  2.6 

A perfect-information  game. 

Using  Definition  2.4,  one  can  associate  with  every  perfect-information  game  a 
strategic-form  (or  normal-form)  game:  a strategy  profile  determines  a unique 
terminal  node  that  is  reached  if  the  players  act  according  to  that  strategy  profile  and 
thus  a unique  vector  of  payoffs.  Figure  2.7  shows  the  strategic-form  associated  with 
the  perfect-information  game  of  Figure  2.6,  with  the  Nash  equilibria  highlighted. 
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Player  1 


Player  2 


ce  cf  de  df 
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1 
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3 

1 

1 

2 

3 

1 

1 

2 

bh 

3 

1 

1 

0 

3 

1 

1 

0 

Figure  2.7 

The  strategic  form  of  the  perfect-information  game  of  Figure  2.6 
with  the  Nash  equilibria  highlighted. 


Because  of  the  redundancy  discussed  above,  the  strategic  form  also  displays 
redundancies:  in  this  case  the  top  two  rows  are  identical. 


Armed  with  the  notion  of  strategy,  we  can  now  revisit  the  notion  of  backward- 
induction  solution.  Figure  2.8  shows  the  two  backward-induction  solutions  of  the 
game  of  Figure  2.6. 


Figure  2.8 

The  backward-induction  solutions  of  the  game  of  Figure  2.6. 
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It  is  clear  from  the  definition  of  the  backward-induction  algorithm  (Definition  2.3) 
that  the  procedure  selects  a choice  at  every  decision  node  and  thus  yields  a strategy 
profile  for  the  entire  game:  the  backward-induction  solution  shown  in  Panel  a of 
Figure  2.8  is  the  strategy  profile  ((a,g),(c,/)) , while  the  backward-induction 

solution  shown  in  Panel  b is  the  strategy  profile  {(bji),{c,e)) . Both  of  them  are 

Nash  equilibria  of  the  strategic  form,  but  not  all  the  Nash  equilibria  correspond  to 
backward  -induction  solutions.  The  relationship  between  the  two  concepts  will  be 
explained  in  the  following  section. 

Remark  2.1.  A backward-induction  solution  is  a strategy  profile.  Since 
strategies  contain  a description  of  what  a player  actually  does  and  also  of  what 
the  player  would  do  in  circumstances  that  do  not  arise,  one  often  draws  a 
distinction  between  the  backward-induction  solution  and  the  backward- 
induction  outcome  which  is  defined  as  the  sequence  of  actual  moves.  For 
example,  the  backward-induction  outcome  associated  with  the  solution 
((a,g),(c,/))  is  the  play  ac  with  corresponding  payoff  (2,1),  while  the 

backward-induction  outcome  associated  with  the  solution  (( b,h),(c,e ))  is  the 
play  be  with  corresponding  payoff  (3,1). 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  2.E.3  of  Appendix  2.E  at  the 
end  of  this  chapter. 


2.4  Relationship  between  backward 
induction  and  other  solutions 

If  you  have  gone  through  the  exercises  for  the  previous  three  sections,  you  will 
have  seen  that  in  all  those  games  the  backward-induction  solutions  are  also 
Nash  equilibria.  This  is  always  true,  as  stated  in  the  following  theorem. 

Theorem  2.1.  Every  backward-induction  solution  of  a perfect-information 
game  is  a Nash  equilibrium  of  the  associated  strategic  form. 

In  some  games  the  set  of  backward-induction  solutions  coincides  with  the  set  of 
Nash  equilibria  (see,  for  example,  Exercise  2.9),  but  typically  the  set  of  Nash 
equilibria  is  larger  than  (is  a proper  superset  of)  the  set  of  backward-induction 
solutions  (as  shown  for  the  game  of  Figure  2.6,  which  has  two  backward-induction 
solutions,  shown  in  Figure  2.8,  but  five  Nash  equilibria,  shown  in  Figure  2.7).  Nash 
equilibria  that  are  not  backward-induction  solutions  often  involve  incredible  threats. 
To  see  this,  consider  the  following  game.  An  industry  is  currently  a monopoly  and 
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the  incumbent  monopolist  is  currently  making  a profit  of  $5  million.  A potential 
entrant  is  considering  whether  or  not  to  enter  this  industry.  If  she  does  not  enter, 
she  will  make  $1  million  in  an  alternative  investment;  if  she  does  enter,  then  the 
incumbent  can  either  fight  entry  with  a price  war  whose  outcome  is  that  both  firms 
make  zero  profits,  or  it  can  accommodate  entry,  by  sharing  the  market  with  the 
entrant,  in  which  case  both  firms  make  a profit  of  $2  million.  This  situation  is 
illustrated  in  Figure  2.9  with  the  associated  strategic  form.  Note  that  we  are 
assuming  that  each  player  is  selfish  and  greedy,  that  is,  cares  only  about  its  own 
profit  and  prefers  more  money  to  less. 
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Figure  2.9 

The  entry  game. 

The  backward-induction  solution  is  (in,  accommodate)  and  it  is  also  a Nash 
equilibrium.  However,  there  is  another  Nash  equilibrium,  namely  (out,  fight).  The 
latter  should  be  discarded  as  a “rational  solution”  because  it  involves  an  incredible 
threat  on  the  part  of  the  incumbent,  namely  that  it  will  fight  entry  if  the  potential 
entrant  enters.  It  is  true  that,  if  the  potential  entrant  believes  the  incumbent’s  threat, 
then  she  is  better  off  staying  out;  however,  she  should  ignore  the  incumbent’s 
threat  because  she  should  realize  that  - when  faced  with  the  fait  accompli  of  entry 
-the  incumbent  would  not  want  to  carry  out  the  threat. 

Reinhard  Selten  (who  shared  the  1994  Nobel  prize  in  economics  with  two  other 
game  theorists,  John  Harsanyi  and  John  Nash)  discussed  a repeated  version  of  the 
above  entry  game,  which  has  become  known  as  Selten’s  Chain  Store  Game.  The 
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story  is  as  follows.  A chain  store  is  a monopolist  in  an  industry.  It  owns  stores  in  m 
different  towns  (m  > 2).  In  each  town  the  chain  store  makes  $5  million  if  left  to 
enjoy  its  privileged  position  undisturbed.  In  each  town  there  is  a businesswoman 
who  could  enter  the  industry  in  that  town,  but  earns  $1  million  if  she  chooses  not 
to  enter;  if  she  decides  to  enter,  then  the  monopolist  can  either  fight  the  entrant, 
leading  to  zero  profits  for  both  the  chain  store  and  the  entrant  in  that  town,  or  it 
can  accommodate  entry  and  share  the  market  with  the  entrant,  in  which  case  both 
players  make  $2  million  in  that  town.  Thus,  in  each  town  the  interaction  between 
the  incumbent  monopolist  and  the  potential  entrant  is  as  illustrated  in  Figure  2.9 
above.  However,  decisions  are  made  sequentially,  as  follows.  At  date  t (t  = 1, ...,  m) 
the  businesswoman  in  town  t decides  whether  or  not  to  enter  and  if  she  enters  then 
the  chain  store  decides  whether  or  not  to  fight  in  that  town.  What  happens  in  town 
t at  date  t becomes  known  to  everybody.  Thus,  for  example,  the  businesswoman  in 
town  2 at  date  2 knows  what  happened  in  town  1 at  date  1 (either  that  there  was  no 
entry  or  that  entry  was  met  with  a fight  or  that  entry  was  accommodated). 
Intuition  suggests  that  in  this  game  the  threat  by  the  incumbent  to  fight  early 
entrants  might  be  credible,  for  the  following  reason.  The  incumbent  could  tell 
Businesswoman  1 the  following: 

“It  is  true  that,  if  you  enter  and  I fight,  I will  make  zero 
profits,  while  by  accommodating  your  entry  I would  make  $2 
million  and  thus  it  would  seem  that  it  cannot  be  in  my 
interest  to  fight  you.  However,  somebody  else  is  watching  us, 
namely  Businesswoman  2.  If  she  sees  that  I have  fought  your 
entry  then  she  might  fear  that  I would  do  the  same  with  her 
and  decide  to  stay  out,  in  which  case  in  town  2 I would  make 
$5  million,  so  that  my  total  profits  in  towns  1 and  2 would  be 
$(0  + 5)  = $5  million.  On  the  other  hand,  if  I accommodate 
your  entry,  then  she  will  be  encouraged  to  entry  herself  and  I 
will  make  $2  million  in  each  town,  for  a total  profit  of  $4 
million.  Hence  - as  you  can  see  - it  is  indeed  in  my  interest  to 
fight  you  and  thus  you  should  stay  out.” 

Does  the  notion  of  backward  induction  capture  this  intuition?  To  check  this,  let  us 
consider  the  case  where  m = 2,  so  that  the  extensive  game  is  not  too  large  to  draw. 
It  is  shown  in  Figure  2.10,  where  at  each  terminal  node  the  top  number  is  the  profit 
of  the  incumbent  monopolist  (it  is  the  sum  of  the  profits  in  the  two  towns),  the 
middle  number  is  the  profit  of  Businesswoman  1 and  the  bottom  number  is  the 
profit  of  Businesswoman  2.  All  profits  are  expressed  in  millions  of  dollars.  We 
assume  that  all  the  players  are  selfish  and  greedy,  so  that  we  can  take  the  profit  of 
each  player  to  be  that  player’s  payoff. 
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The  backward-induction  solution  is  unique  and  is  shown  by  the  thick  directed 
edges  in  Figure  2.10.  The  corresponding  outcome  is  that  both  businesswomen  will 
enter  and  the  incumbent  monopolist  accommodates  entry  in  both  towns. 
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Figure  2.10 

Selten’s  Chain-Store  game. 

Thus  the  backward-induction  solution  does  not  capture  the  “reputation”  argument 
outlined  above.  However,  the  backward-induction  solution  does  seem  to  capture 
the  notion  of  rational  behavior  in  this  game.  Indeed,  Businesswoman  1 should  reply 
to  the  incumbent  with  the  following  counter-argument: 
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“Your  reasoning  is  not  valid.  Whatever  happens  in  town  1,  it 
will  be  common  knowledge  between  you  and  Businesswoman 
2 that  your  interaction  in  town  2 will  be  the  last;  in  particular, 
nobody  else  will  be  watching  and  thus  there  won’t  be  an  issue 
of  establishing  a reputation  in  the  eyes  of  another  player. 

Hence  in  town  2 it  will  be  in  your  interest  to  accommodate 
entry,  since  - in  essence  -you  will  be  playing  the  one-shot 
entry  game  of  Figure  2.9.  Hence  a rational  Businesswoman  2 
will  decide  to  enter  in  town  2 whatever  happened  in  town  1: 
what  you  do  against  me  will  have  no  influence  on  her  decision. 

Thus  your  “reputation”  argument  does  not  apply  and  it  will  in 
fact  be  in  your  interest  not  to  fight  my  entry:  your  choice  will 
be  between  a profit  of  $(0  + 2)  = $2  million,  if  you  fight  me, 
and  a profit  of  $(2  + 2)  = $4  million,  if  you  don’t  fight  me. 

Hence  I will  enter  and  you  will  not  fight  me.” 

In  order  to  capture  the  reputation  argument  described  above  we  need  to  allow  for 
some  uncertainty  in  the  mind  of  some  of  the  players,  as  we  will  show  in  a later 
chapter.  In  a perfect-information  game  uncertainty  is  ruled  out  by  definition. 

By  Theorem  2.1  the  notion  of  backward  induction  can  be  seen  as  a refinement  of 
the  notion  of  Nash  equilibrium.  Another  solution  concept  that  is  related  to 
backward  induction  is  the  iterated  elimination  of  weakly  dominated  strategies. 
Indeed  the  backward-induction  algorithm  could  be  viewed  as  a step-wise  procedure 
that  eliminates  dominated  choices  at  decision  nodes,  and  thus  strategies  that  contain 
those  choices.  What  is  the  relationship  between  the  two  notions?  In  general  this  is 
all  that  can  be  said:  applying  the  iterated  deletion  of  weakly  dominated  strategies  to 
the  strategic  form  associated  with  a perfect-information  game  leads  to  a set  of 
strategy  profiles  that  contains  at  least  one  backward-induction  solution;  however, 
(1)  it  may  also  contain  strategy  profiles  that  are  not  backward-induction  solutions 
and  (2)  it  may  fail  to  contain  all  the  backward-induction  solutions,  as  shown  in 
Exercise  2.8. 
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2 . 5 Perfect-information  games  with  two  players 

We  conclude  this  chapter  with  a discussion  of  finite  two-player  extensive  games 
with  perfect  information. 

We  will  start  with  games  that  have  only  to  outcomes,  namely  “Player  f wins” 
(denoted  by  Wx ) and  “Player  2 wins”  (denoted  by  fV2).  We  assume  that  Player  1 
strictly  prefers  Wl  to  W2  and  Player  2 strictly  prefers  W2  to  Wl . Thus  we  can 
use  utility  functions  with  values  0 and  f and  associate  with  each  terminal  node 
either  the  payoff  vector  (1,0)  (if  the  outcome  is  Rj)  or  the  payoff  vector  (0,1)  (if 
the  outcome  is  W2).  We  call  these  games  win-lose  games.  An  example  of  such  a 
game  is  the  following. 

Example  2.2.  Two  players  take  turns  choosing  a number  from  the  set 
(1,2,.. .,10},  with  Player  1 moving  first.  The  first  player  who  brings  the  sum  of 
all  the  chosen  numbers  to  100  or  more  wins. 

The  following  is  one  possible  play  of  the  game  (the  red  numbers  are  the  ones 
chosen  by  Player  1 and  the  blue  numbers  the  ones  chosen  by  Player  2): 

10,  9,  9,  10,  8,  7,  10,  10,  1,  8,  1,  7,  6,  4 

In  this  play  Player  2 wins:  at  her  last  move  the  sum  is  96  and  with  her  choice  of  4 
she  brings  the  total  to  100.  However,  in  this  game  Player  1 has  a winning  strategy, 
that  is,  a strategy  that  guarantees  that  he  will  win,  no  matter  what  numbers  Player  2 
chooses.  To  see  this,  we  can  use  backward-induction  reasoning.  Drawing  the  tree  is 
not  a practical  option,  since  the  number  of  nodes  is  very  large:  one  needs  10,000 
nodes  just  to  represent  the  first  4 moves!  But  we  can  imagine  drawing  the  tree, 
placing  ourselves  towards  the  end  of  the  tree  and  ask  what  partial  sum  represents  a 
“losing  position”,  in  the  sense  that  whoever  is  choosing  in  that  position  cannot  win, 
while  the  other  player  can  then  win  with  his  subsequent  choice.  With  some 
thought  one  can  see  that  89  is  the  largest  losing  position:  whoever  moves  there  can 
take  the  sum  to  any  number  in  the  set  {90,  91,  ...,  99},  thus  coming  short  of  100, 
while  the  other  player  can  then  take  the  sum  to  100  with  an  appropriate  choice. 
What  is  the  largest  losing  position  that  precedes  89?  The  answer  is  78:  whoever 
moves  at  78  must  take  the  sum  to  a number  in  the  set  {79,  80, ...,  88}  and  then  from 
there  the  other  player  can  make  sure  to  take  the  sum  to  89  and  then  we  know  what 
happens  from  there!  Repeating  this  reasoning  we  see  that  the  losing  positions  are: 
89,  78,  67,  56,  45,  34,  23,  12,  1.  Since  Player  1 moves  first  he  can  choose  1 and  put 
Player  2 in  the  first  losing  position;  then  - whatever  Player  2 chooses  - Player  1 can 
put  her  in  the  next  losing  position,  namely  12,  etc.  Recall  that  a strategy  for  Player  1 
must  specify  what  to  do  in  every  possible  situation  in  which  he  might  find  himself. 
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In  his  game  Player  l’s  winning  strategy  is  as  follows: 

Start  with  the  number  1.  Then,  at  every  turn,  choose 
the  number  (11  -n),  where  n is  the  number  that  was 
chosen  by  Player  2 in  the  immediately  preceding  turn. 

Here  is  an  example  of  a possible  play  of  the  game  where  Player  1 employs  the 
winning  strategy  and  does  in  fact  win: 

1,  9,  2,  6,  5,  7,  4,  10,  1,  8,  3,  3,  8,  9,  2,  5,  6,  1,  10. 

We  can  now  state  a general  result  about  this  class  of  games. 

Theorem  2.2.  In  every  finite  two-player,  win-lose  game  with  perfect 
information  one  of  the  two  players  has  a winning  strategy. 

Although  we  will  not  give  a detailed  proof,  the  argument  of  the  proof  is  rather 
simple.  By  applying  the  backward-induction  algorithm  we  assign  to  every  decision 
node  either  the  payoff  vector  (1,0)  or  the  payoff  vector  (0,1).  Imagine  applying  the 
algorithm  up  to  the  point  where  the  immediate  successors  of  the  root  have  been 
assigned  a payoff  vector.  Two  cases  are  possible. 

Case  1:  at  least  one  of  the  immediate  successors  of  the  root  has  been  assigned  the 
payoff  vector  (1,0).  In  this  case  Player  1 is  the  one  who  has  a winning  strategy  and 
his  initial  choice  should  be  such  that  a node  with  payoff  vector  (1,0)  is  reached  and 
then  his  future  choices  should  also  be  such  that  only  nodes  with  payoff  vector  (1,0) 
are  reached. 

Case  2:  all  the  immediate  successors  of  the  root  have  been  assigned  the  payoff 
vector  (0,1).  In  this  case  it  is  Player  2 who  has  a winning  strategy.  An  example  of  a 
game  where  it  is  Player  2 who  has  a winning  strategy  is  given  in  Exercise  2.11. 

We  now  turn  to  finite  two-player  games  where  there  are  three  possible  outcomes: 
“Player  1 wins”  (h7,),  “Player  2 wins”  ( W2 ) and  “Draw”  (D).  We  assume  that  the 
rankings  of  the  outcomes  are  as  follows:  Wl  A,  D A,  W2  and  W2  >-2  D K Wl  . 
Examples  of  such  games  are  Tic-Tac-Toe  (http:/ / en.wikipedia.org/ wiki/Tic-tac- 
toe),  Draughts  or  Checkers  (http:/ / en.wikipedia.org/ wiki/Draughts)  and  Chess 
(although  there  does  not  seem  to  be  agreement  as  to  whether  the  rules  of  Chess 
guarantee  that  every  possible  play  of  the  game  is  finite).  What  can  we  say  about 
such  games?  The  answer  is  provided  by  the  following  theorem. 
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Theorem  2.3.  Every  finite  two-player,  perfect-information  game  with  three 
outcomes:  Player  1 wins  (h7,),  Player  2 wins  (W2)  and  Draw  (D)  falls  within 
one  of  the  following  three  categories: 

(1)  Player  1 has  a strategy  that  guarantees  outcome  Wl . 

(2)  Player  2 has  a strategy  that  guarantees  outcome  W2 . 

(3)  Player  1 has  a strategy  that  guarantees  that  the  outcome  will  be  Wx  or  D and 

Player  2 has  a strategy  that  guarantees  that  the  outcome  will  be  W2  or  D,  so 

that,  if  both  players  employ  these  strategies,  the  outcome  will  be  D. 

The  logic  of  the  proof  is  as  follows.  By  applying  the  backward-induction  algorithm 
we  assign  to  every  decision  node  either  the  payoff  vector  (2,0)  (corresponding  to 
outcome  W] ) or  the  payoff  vector  (0,2)  (corresponding  to  outcome  W2)  or  the 
payoff  vector  (1,1)  (corresponding  to  outcome  D).  Imagine  applying  the  algorithm 
up  to  the  point  where  the  immediate  successors  of  the  root  have  been  assigned  a 
payoff  vector.  Three  cases  are  possible. 

Case  1:  at  least  one  of  the  immediate  successors  of  the  root  has  been  assigned  the 
payoff  vector  (2,0);  in  this  case  Player  1 is  the  one  who  has  a winning  strategy. 

Case  2:  all  the  immediate  successors  of  the  root  have  been  assigned  the  payoff 
vector  (0,2);  in  this  case  it  is  Player  2 who  has  a winning  strategy. 

Case  3:  there  is  at  least  one  immediate  successor  of  the  root  to  which  the  payoff 
vector  (1,1)  has  been  assigned  and  all  the  other  immediate  successors  of  the  root 
have  been  assigned  either  (1,1)  or  (0,2).  In  this  case  we  fall  within  the  third  category 
of  Theorem  2.10. 

Both  Tic-Tac-Toe  and  Draughts  fall  within  the  third  category 
(http://en.wikipedia.org/wiki/Solved  game#Solved  games).  As  of  2015  it  is  not 
known  to  which  category  the  game  of  Chess  belongs. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  2.E.5  of  Appendix  2.E  at  the 
end  of  this  chapter. 
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Appendix  2.E:  Exercises 

2.E.I.  Exercises  for  Section  2.1:  trees,  frames  and  games 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  2.  1.  How  could  they  do  that!  They  abducted  Speedy,  your  favorite 
tortoise!  They  asked  for  $1,000  in  unmarked  bills  and  threatened  to  kill  Speedy 
if  you  don’t  pay.  Call  the  tortoise-napper  Mr.  T.  Let  the  possible  outcomes  be 
as  follows: 

o,  : you  don't  pay  and  speedy  is  released 

o2 : you  pay  $ 1 , 000  and  speedy  is  released 
o3 : you  don't  pay  and  speedy  is  killed 

o4 : you  pay  $ 1 , 000  and  speedy  is  killed 

You  are  attached  to  Speedy  and  would  be  willing  to  pay  $1,000  to  get  it  back. 
However,  you  also  like  your  money  and  you  prefer  not  to  pay,  conditional  on 
the  two  separate  events  “Speedy  is  released”  and  “Speedy  is  killed”.  Thus  your 
ranking  of  the  outcomes  is  ox  >■  o2  >-  o3  >-vou  o4 . On  the  other  hand,  you 

are  not  quite  sure  of  what  Mr.  T’s  ranking  is. 

(a)  Suppose  first  that  Mr  T has  communicated  that  he  wants  you  to  go  to 
Central  Park  tomorrow  at  10:00am  and  leave  the  money  in  a garbage  can; 
he  also  said  that,  two  miles  to  the  East  and  at  the  exact  same  time,  he  will 
decide  whether  or  not  to  free  Speedy  in  front  of  the  police  station  and  then 
go  and  collect  his  money  in  Central  Park.  What  should  you  do? 

(b)  Suppose  that  Mr  T is  not  as  dumb  as  in  part  (a)  and  he  instead  gives  you  the 
following  instructions:  first  you  leave  the  money  in  a garbage  can  in  Central 
Park  and  then  he  will  go  there  to  collect  the  money.  He  also  told  you  that  if 
you  left  the  money  there  then  he  will  free  Speedy,  otherwise  he  will  kill  it. 
Draw  an  extensive  form  or  frame  to  represent  this  situation. 

(c)  Now  we  want  to  construct  a game  based  on  the  extensive  form  of  part  (b). 
For  this  we  need  Mr  T’s  preferences.  There  are  two  types  of  criminals  in 
Mr  T’s  line  of  work:  the  professionals  and  the  one-timers.  Professionals  are 
in  the  business  for  the  long  term  and  thus  worry  about  reputation;  they 
want  it  to  be  known  that  (f)  every  time  they  were  paid  they  honored  their 
promise  to  free  the  hostage  and  (2)  their  threats  are  to  be  taken  seriously,  in 
the  sense  that  every  time  they  were  not  paid,  the  hostage  was  killed.  The 
one-timers  hit  once  and  then  they  disappear;  they  don’t  try  to  establish  a 
reputation  and  the  only  thing  they  worry  about,  besides  money,  is  not  to 
be  caught:  whether  or  not  they  get  paid,  they  prefer  to  kill  the  hostage  in 
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order  to  eliminate  any  kind  of  evidence  (DNA  traces,  fingerprints,  etc.). 
Construct  two  games  based  on  the  extensive  form  of  part  (b)  representing 
the  two  possible  types  of  Mr  T. 

Exercise  2.2.  A three-man  board,  composed  of  A,  B,  and  C,  has  held  hearings 
on  a personnel  case  involving  an  officer  of  the  company.  This  officer  was 
scheduled  for  promotion  but,  prior  to  final  action  on  his  promotion,  he  made  a 
decision  that  cost  the  company  a good  deal  of  money.  The  question  is  whether 
he  should  be  (1)  promoted  anyway,  (2)  denied  the  promotion,  or  (3)  fired.  The 
board  has  discussed  the  matter  at  length  and  is  unable  to  reach  unanimous 
agreement.  In  the  course  of  the  discussion  it  has  become  clear  to  all  three  of 
them  that  their  separate  opinions  are  as  follows: 

• A considers  the  officer  to  have  been  a victim  of  bad  luck,  not  bad  judgment, 
and  wants  to  go  ahead  and  promote  him  but,  failing  that,  would  keep  him 
rather  than  fire  him. 

• B considers  the  mistake  serious  enough  to  bar  promotion  altogether;  he’d 
prefer  to  keep  the  officer,  denying  promotion,  but  would  rather  fire  than 
promote  him. 

• C thinks  the  man  ought  to  be  fired  but,  in  terms  of  personnel  policy  and 
morale,  believes  the  man  ought  not  to  be  kept  unless  he  is  promoted,  i.e., 
that  keeping  an  officer  who  has  been  declared  unfit  for  promotion  is  even 
worse  than  promoting  him. 

To  recapitulate,  their  preferences  among  the  three  outcomes  are 


PROMOTE 

KEEP 

FIRE 

A: 

best 

middle 

worst 

B: 

worst 

best 

middle 

C: 

middle 

worst 

best 

Assume  that  everyone’s  preferences  among  the  three  outcomes  are  fully  evident 
as  a result  of  discussion.  The  three  must  proceed  to  a vote. 

Consider  the  following  voting  procedure.  First  A proposes  an  action  (either 
promote  or  keep  or  fire).  Then  it  is  B’s  turn.  If  B accepts  A’s  proposal,  then  this 
becomes  the  final  decision.  If  B disagrees  with  A’a  proposal,  then  C makes  the 
final  decision  (which  may  be  any  of  the  three-,  promote,  keep  of  fire).  Represent 
this  situation  as  an  extensive  game  with  perfect  information. 
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2.E.2.  Exercises  for  Section  2.2:  backward  induction. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  2.3.  Apply  the  backward-induction  algorithm  to  the  two  games  of 
Exercise  2.f  Part  c. 

Exercise  2.4.  Apply  the  backward-induction  algorithm  to  the  game  of  Exercise  2.2. 

2.E.3.  Exercises  for  Section  2.3: 

strategies  in  perfect-information  games. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  2.5.  Write  the  strategic  form  of  the  game  of  Figure  2.2,  find  all  the 
Nash  equilibria  and  verify  that  the  backward-induction  solution  is  a Nash 
equilibrium. 

Exercise  2.6.  Write  the  strategic  form  of  the  game  of  Figure  2.3,  find  all  the 
Nash  equilibria  and  verify  that  the  backward-induction  solutions  are  Nash 
equilibria. 

Exercise  2.7.  (a)  Write  down  all  the  strategies  of  Player  B in  the  game  of  Exercise 

2.2. 

(b)  How  many  strategies  does  Player  C have? 

Exercise  2.8.  Consider  the  following  perfect-information  game: 

1 

L 

2 
a 


2 4 2 3 W 

1 0 0 2 


1 0 

2 3 

(a)  Find  the  backward-induction  solutions. 

(b)  Write  down  all  the  strategies  of  Player  1. 

(c)  Write  down  all  the  strategies  of  Player  2. 

(d)  Write  the  strategic  form  associated  with  this  game. 

(e)  Does  Player  f have  a dominant  strategy? 

(f)  Does  Player  2 have  a dominant  strategy? 
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(g)  Is  there  a dominant-strategy  equilibrium? 

(h)  Does  Player  1 have  any  dominated  strategies? 

(i)  Does  Player  2 have  any  dominated  strategies? 

(j)  What  do  you  get  when  you  apply  the  iterative  elimination  of  weakly 
dominated  strategies? 

(k)  What  are  the  Nash  equilibria? 

Exercise  2.9.  Consider  an  industry  where  there  are  two  firms,  a large  firm, 
Firm  1,  and  a small  firm,  Firm  2.  The  two  firms  produce  identical  products.  Let 
x be  the  output  of  Firm  1 and  y the  output  of  Firm  2.  Industry  output  is 
Q = x + y.  The  price  P at  which  each  unit  of  output  can  be  sold  is  determined 
by  the  inverse  demand  function  P = 130 -\0Q.  For  example,  if  Firm  1 
produces  4 units  and  Firm  2 produces  2 units,  then  industry  output  is  6 and 
each  unit  is  sold  for  P = 130-60  = $70.  For  each  firm  the  cost  of  producing  q 
units  of  output  is  C(q)  = 10g  + 62.5  . Each  firm  is  only  interested  in  its  own 
profits.  The  profit  of  Firm  1 depends  on  both  x and  y and  is  given  by 
II,  (x,  y)  = x [l  30  - 1 0(x  + y)]  - (1  Ox  + 62.5)  and  similarly  the  profit  function  of 

revenue  cos* 

Firm  2 is  given  by  II2(x,y)  = y[l 30-1 0(x  + y)]-(l 0y  + 62.5).  The  two  firms 

revenue  cos* 

play  the  following  sequential  game.  First  Firm  1 chooses  its  own  output  x and 
commits  to  it;  then  Firm  2,  after  having  observed  Firm  l’s  output,  chooses  its 
own  output  y;  then  the  price  is  determined  according  to  the  demand  function 
and  the  two  firms  collect  their  own  profits.  In  what  follows  assume,  for 
simplicity,  that  x can  only  be  6 or  6.5  units  and  y can  only  be  2.5  or  3 units. 

(a)  Represent  this  situation  as  an  extensive  game  with  perfect  information. 

(b)  Solve  the  game  using  backward  induction. 

(c)  Write  the  strategic  form  associated  with  the  perfect-information  game. 

(d)  Find  the  Nash  equilibria  of  this  game  and  verify  that  the  backward- 
induction  solutions  are  Nash  equilibria. 
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Exercise  2.10.  Consider  the  following  perfect-information  game  where  x is  an 
integer. 

1 


(a)  For  every  value  of  x find  the  backward  induction  solution(s). 

(b)  Write  the  corresponding  strategic-form  and  find  all  the  Nash  equilibria. 

2.E.5.  Exercises  for  Section  2.5:  two-player  games. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  2.11.  Consider  the  following  perfect-information  game.  Player  1 starts 
by  choosing  a number  from  the  set  {1,2,3,4,5,6,7},  then  Player  2 chooses  a 
number  from  this  set,  then  Player  1 again,  followed  by  Player  2,  etc.  The  first 
player  who  brings  the  cumulative  sum  of  all  the  numbers  chosen  (up  to  and 
including  the  last  one)  to  48  or  more  wins.  By  Theorem  2.9  one  of  the  two 
players  has  a winning  strategy.  Find  out  who  that  player  is  and  fully  describe 
the  winning  strategy. 

Exercise  2.12.  Consider  the  following  two-player,  perfect-information  game.  A 
coin  is  placed  in  the  cell  marked  ‘START’  (cell  Al).  Player  1 moves  first  and 
can  move  the  coin  one  cell  up  (to  A2)  or  one  cell  to  the  left  (to  Bl)  or  one  cell 
diagonally  in  the  left-up  direction  (to  B2).  Then  Player  2 moves,  according  to 
the  same  rules  (e.g.  if  the  coin  is  in  cell  B2  then  the  admissible  moves  are  shown 
by  the  directed  edges).  The  players  alternate  moving  the  coin.  Black  cells  are 
not  accessible  (so  that,  for  example,  from  A3  the  coin  can  only  be  moved  to  A4 
or  B3  and  from  F3  it  can  only  be  moved  to  G4,  as  shown  by  the  directed  edge). 
The  player  who  manages  to  place  the  coin  in  the  cell  marked  ‘END’  wins. 
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(a)  Represent  this  game  by  means  of  an  extensive  form  with  perfect 
information  by  drawing  the  initial  part  of  the  tree  that  covers  the  first 
two  moves  (the  first  move  of  Player  f and  the  first  move  of  Player  2). 

(b)  Suppose  that  the  coin  is  currently  in  cell  G4  and  it  is  Player  l’s  turn  to 
move.  Show  that  Player  1 has  a strategy  that  allows  her  to  win  the  game 
starting  from  cell  G4.  Describe  the  strategy  in  detail. 

(c)  Describe  a play  of  the  game  (from  cell  Al)  where  Player  f wins 
(describe  it  by  means  of  the  sequence  of  cells  visited  by  the  coin). 

(d)  Describe  a play  of  the  game  (from  cell  Al)  where  Player  2 wins 
(describe  it  by  means  of  the  sequence  of  cells  visited  by  the  coin). 

(e)  Now  go  back  to  the  beginning  of  the  game.  The  coin  is  in  cell  Al  and 
player  1 has  the  first  move.  By  Theorem  2.9  one  of  the  two  players  has 
a winning  strategy.  Find  out  who  that  player  is  and  fully  describe  the 
winning  strategy. 
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000000000000 

Exercise  2.13:  Challenging  Question.  Two  women,  Anna  and  Bess,  claim  to 
be  the  legal  owners  of  a diamond  ring  that  - each  claims  - has  great  sentimental 
value.  Neither  of  them  can  produce  evidence  of  ownership  and  nobody  else  is 
staking  a claim  on  the  ring.  Judge  Sabio  wants  the  ring  to  go  to  the  legal  owner, 
but  he  does  not  know  which  of  the  two  women  is  in  fact  the  legal  owner.  He 
decides  to  proceed  as  follows.  First  he  announces  a fine  of  $F  > 0 and  then  asks 
Anna  and  Bess  to  play  the  following  game. 

Move  1:  Anna  moves  first.  Either  she  gives  up  her  claim  to  the  ring  (in  which 
case  Bess  gets  the  ring,  the  game  ends  and  nobody  pays  the  fine)  or  she  asserts 
her  claim,  in  which  case  the  game  proceeds  to  Move  2. 

Move  2:  Bess  either  accepts  Anna’s  claim  (in  which  case  Anna  gets  the  ring,  the 
game  ends  and  nobody  pays  the  fine)  or  challenges  her  claim.  In  the  latter  case, 
Bess  must  put  in  a bid,  call  it  B,  and  Anna  must  pay  the  fine  of  $F  to  Sabio.  The 
game  goes  on  to  Move  3. 

Move  3:  Anna  now  either  matches  Bess’s  bid  (in  which  case  Anna  gets  the  ring, 
Anna  pays  $B  to  Sabio  - in  addition  to  the  fine  that  she  already  paid  - and 
Bess  pays  the  fine  of  $F  to  Sabio)  or  chooses  not  to  match  (in  which  case  Bess 
gets  the  ring  and  pays  her  bid  of  $B  to  Sabio  and,  furthermore,  Sabio  keeps  the 
fine  that  Anna  already  paid). 

Denote  by  CA  the  monetary  equivalent  of  getting  the  ring  for  Anna  (that  is, 
getting  the  ring  is  as  good,  in  Anna’s  mind,  as  getting  SCJ  and  CB  the 
monetary  equivalent  of  getting  the  ring  for  Bess.  Not  getting  the  ring  is 
considered  by  both  as  good  as  getting  zero  dollars. 

(a)  Draw  an  extensive  game  with  perfect  information  to  represent  the  above 
situation,  assuming  that  there  are  only  two  possible  bids:  B:  and  B2.  Write 

the  payoffs  to  Anna  and  Bess  next  to  each  terminal  node. 

(b)  Find  the  backward-induction  solution  of  the  game  you  drew  in  part  (a)  for 
the  case  where  Bt  > CA  > CB  > B2  > F > 0. 

Now  consider  the  general  case  where  the  bid  B can  be  any  non-negative 
number  and  assume  that  both  Anna  and  Bess  are  very  wealthy.  Assume  also 
that  CA,  CB  and  F are  positive  numbers  and  that  CA  and  CB  are  common 

knowledge  between  Anna  and  Bess  . We  want  to  show  that,  at  the  backward- 
induction  solution  of  the  game,  the  ring  always  goes  to  the  legal  owner.  Since 
we  (like  Sabio)  don’t  know  who  the  legal  owner  is,  we  must  consider  two  cases. 
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Case  1:  the  legal  owner  is  Anna.  Let  us  assume  that  this  implies  that  CA  > CB. 

Case  2:  the  legal  owner  is  Bess.  Let  us  assume  that  this  implies  that  CB  > CA. 

(c)  Find  the  backward-induction  solution  for  Case  1 and  show  that  it  implies 
that  the  ring  goes  to  Anna. 

(d)  Find  the  backward-induction  solution  for  Case  2 and  show  that  it  implies 
that  the  ring  goes  to  Bess. 

(e)  How  much  money  does  Sabio  make  in  equilibrium?  How  much  money  do 
Ann  and  Bess  end  up  paying  in  equilibrium?  (By  ‘equilibrium’  we  mean 
‘backward  induction  solution’.) 
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Appendix  2.S:  Solutions  to  exercises 

Exercise  2.1.  (a)  For  you  it  is  a strictly  dominant  strategy  to  not  pay  and  thus 
you  should  not  pay. 

(b)  The  extensive  form  is  as  follows: 

You 


(c)  For  the  professional,  concern  with  reputation  implies  that  o2  >-MrT  o4  and 
°3  ^m,-t  °\  ■ ^ we  add  the  reasonable  assumption  that  - after  all  - money  is 
what  they  are  after,  then  we  can  take  the  full  ranking  to  be 
°2  Tw/t  °4  ^ MrT  °3  >~\4rT  °\  • Representing  preferences  with  ordinal  utility 
functions  with  values  in  the  set  {1,2, 3, 4},  we  have 


outcome  — » 

°1 

o2 

utility  function  4 

U 

you 

4 

3 

2 

1 

U MrT 

1 

4 

2 

3 

The  corresponding  game  is  as  follows: 

You 


12  4 3 
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For  the  one-timer,  the  ranking  can  be  taken  to  be  (although  this  is  not  the  only 
possibility)  o4  >-MrT  o2  >MrT  o3  >-MrT  , with  corresponding  utility  representation: 


outcome  — » 
utility  function  4- 

°1 

o2 

o3  o4 

U 

you 

4 

3 

2 

1 

U MrT 

1 

3 

2 

4 

and  extensive  game: 


You 


l 


Exercise  2.2. 

The  game  is  as  follows  (“P”  stands  for  promote,  “K”  for  keep  (without 
promoting),  “F”  for  fire): 

A 


1 

2 


2 

3 


1 

2 


2 

3 


1 

2 


2 

3 
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Exercise  2.3.  The  application  of  the  backward-induction  algorithm  is  shown 
by  double  edges  in  the  following  games,  the  first  refers  to  the  professional  Mr. 
T and  the  second  to  the  one-timer. 


You 


12  4 3 


You 


12  3 4 

Thus,  against  a professional  you  would  pay  and  against  a one-timer  you  would 
not  pay.  With  the  professional  you  would  get  Speedy  back,  with  the  one-timer 
you  will  hold  a memorial  service  for  Speedy. 
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Exercise  2.4.  The  backward-induction  algorithm  yields  two  solutions,  shown 
below.  The  difference  between  the  two  solutions  lies  in  what  Player  B would 
do  if  Player  A proposed  F.  In  both  solutions  the  officer  is  kept  without 
promotion. 
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Exercise  2.5.  The  game  of  Figure  2.2  is  reproduced  below,  with  the  unique 
backward-induction  solution  marked  by  double  edges: 

1 


3 2 1 2 

The  corresponding  strategic  form  is  as  follows  (for  each  of  Player  2’s  strategies, 
the  first  element  in  the  pair  is  what  Player  2 would  do  at  her  left  node  and  the 
second  element  what  she  would  do  at  her  right  node).  The  Nash  equilibria  are 
highlighted.  One  Nash  equilibrium,  namely  (Offer  50-50, (Accept, Reject)), 
corresponds  to  the  backward  induction  solution,  while  the  other  Nash 
equilibrium,  namely  (Offer  70-30, ( Reject, Reject))  does  not  correspond  to  a 
backward-induction  solution. 


offer  50-50 

Player  1 

offer  70-30 


Player  2 

(Accept, Accept)  (Accept, Reject)  (Reject, Accept)  (Reject, Reject) 


2 

3 

2 

3 

1 

2 

1 

2 

3 

1 

1 

2 

3 

1 

f 

2 

Exercise  2.6.  The  game  of  Figure  2.3  is  reproduced  below  with  the  two 
backward-induction  solutions  marked  by  double  edges. 
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The  corresponding  strategic  form  is  as  follows.  The  Nash  equilibria  are 
highlighted.  The  backward-induction  solutions  are  ( a,(c,f),g ) and  (b,(c,e)  ,h) 
and  both  of  them  are  Nash  equilibria.  There  are  three  more  Nash  equilibria 
which  are  not  backward-induction  solutions,  namely  (a ,(d,f)  ,g),  ( b,(c,f),h ) 
and  ( b,(d , e)  ,h). 


Player  2 

ce  cf  de  df 


Player 

a 

2 

1 

0 

2 

1 

0 

0 

0 

2 

0 

0 

2 

1 

b 

3 

1 

0 

1 

2 

1 

3 

1 

0 

1 

2 

1 

Player  3:  g 


Player  2 

ce  cf  de  df 


Player 

a 

2 

1 

0 

2 

1 

0 

0 

0 

2 

0 

0 

2 

1 

b 

3 

1 

0 

0 

0 

1 

3 

1 

0 

0 

0 

1 

Player  3:  h 


Exercise  2.7.  The  game  of  Exercise  2.2  is  reproduced  below: 

A 


1 3 2 1 3 2 1 3 2 

2 1 3 2 1 3 2 1 3 

(a)  All  the  possible  strategies  of  Player  B are  shown  in  the  following  table: 


If  A chooses  P 

If  A chooses  K 

If  A chooses  F 

1 

accept 

accept 

accept 

2 

accept 

accept 

reject 

3 

accept 

reject 

accept 

4 

accept 

reject 

reject 

5 

reject 

accept 

accept 

6 

reject 

accept 

reject 

7 

reject 

reject 

accept 

8 

reject 

reject 

reject 
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(b)  Player  C has  three  decision  nodes  and  three  choices  at  each  of  her  nodes. 
Thus  she  has  3 x 3 x 3 = 27  strategies. 


Exercise  2.8. 

(a)  One  backward-induction  solution  is  the  strategy  profile  ((L,W),(a,e)) 

shown  in  the  following  figure  by  doubling  the  corresponding  edges.  The 
corresponding  backward-induction  outcome  is  the  play  La  with  associated 
payoff  vector  (2,1). 

1 


2 3 

The  other  backward-induction  solution  is  the  strategy  profile 
(( R,W),(a,d ))  shown  in  the  following  figure.  The  corresponding 

backward-induction  outcome  is  the  play  Rd  with  associated  payoff  vector 

(3.2). 
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1 


2 3 

(b)  Player  1 has  four  strategies  are:  L W,  LE,  R W,  RE. 

(c)  Player  2 has  six  strategies:  ac,  ad,  ae,  be,  bd,  be. 

(d)  The  strategic  form  is  as  follows: 

2 


ac  ad  ae  be  bd  be 


LW 

2 . 

, 1 

2 . 

, 1 

2 . 

, 1 

4 : 

, o 

4 : 

, o 

4 : 

, 0 

1 LE 

2 . 

, 1 

2 . 

, 1 

2 . 

, 1 

4 : 

, o 

4 : 

, 0 

4 : 

, 0 

RW 

2 . 

, 0 

3 . 

, 2 

1 . 

, 2 

2 . 

, 0 

3 . 

, 2 

1 . 

, 2 

RE 

2 . 

, 0 

3 . 

, 2 

0 : 

, 3 

2 . 

, 0 

3 . 

, 2 

0 . 

, 3 

(e)  Player  1 does  not  have  a dominant  strategy. 

(f)  For  Player  2 ae  is  a weakly  dominant  strategy. 

(g)  There  is  no  dominant  strategy  equilibrium. 

(h)  For  Player  1 RE  is  weakly  dominated  by  RW  (and  EW  and  LE  are 
equivalent). 

(i)  For  Player  2 ac  is  weakly  dominated  by  ad  (or  ae),  ad  is  weakly 
dominated  by  ae,  be  is  (strictly  or  weakly)  dominated  by  every  other 
strategy,  bd  is  weakly  dominated  by  be  (and  by  ae  and  ad),  be  is  weakly 
dominated  by  ae.  Thus  the  dominated  strategies  are:  ac,  ad,  be,  bd  and  be. 
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(j)  The  iterative  elimination  of  weakly  dominated  strategies  yields  the 
following  reduced  game  (in  Step  f eliminate  RE  for  Player  1 and  ac,  ad, 
be,  bd  and  be  for  Player  2;  in  Step  2 eliminate  RW  for  Player  1: 

ae 

LW 
LE 

Thus  we  are  left  with  one  of  the  two  backward-induction  solutions, 
namely  ((L,W),(a,e)J  but  also  with  ((L,E),(a,e))  which  is  not  a 
backward-induction  solution. 

(k)  The  Nash  equilibria  are  highlighted  in  the  following  table.  There  are 
five  Nash  equilibria:  (LW,ac),  ( LE,ac ),  (RW,ad),  ( LW,ae ) and  ( LE,ae ). 


LW 
1 LE 
RW 
RE 


2 


ac  ad  ae  be  bd  be 


2 

1 

2 

1 

2 

1 

4 

0 

4 

0 

4 

0 

| 

1 

2 

1 

2 1 

1 

4 

0 

4 

0 

4 

0 

2 

0 

3 

2 

1 

2 

2 

0 

3 

2 

1 

2 

2 

0 

3 

2 

0 

3 

2 

0 

3 

2 

0 

3 

2 

1 

2 

1 

Exercise  2.9.  (a)  The  extensive  game  is  as  follows: 

1 


25  27.5  12.5  12.5 


(b)  There  are  two  backward-induction  solutions.  The  first  is  the  strategy  profile 
(6,  (3, 3))  shown  in  the  following  figure.  The  corresponding  backward- 

induction  outcome  is  given  by  Firm  1 producing  6 units  and  Firm  2 
producing  3 units  with  profits  117.5  for  Firm  1 and  27.5  for  Firm  2. 
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1 


The  other  backward-induction  solution  is  the  strategy  profile  (6.5, (3, 2.5)) 

shown  in  the  following  figure.  The  corresponding  backward-induction 
outcome  is  given  by  Firm  f producing  6.5  units  and  Firm  2 producing  2.5 
units  with  profits  132.5  for  Firm  1 and  12.5  for  Firm  2. 

1 


(c)  The  strategic  form  is  as  follows. 
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6 

Firm  1 

6.5 


(d)  The  Nash  equilibria  are  highlighted  in  the  above  table.  In  this  game  the  set 
of  Nash  equilibria  coincides  with  the  set  of  backward-induction  solutions. 

Exercise  2.10.  The  game  under  consideration  is  the  following,  where  x is  an 
integer: 

1 


(a)  The  backward-induction  strategy  of  Player  2 is  the  same,  no  matter  what  x 
is,  namely  ( C,F ).  Thus  the  backward  induction  solutions  are  as  follows. 

• If  x < 2,  there  is  only  one:  (A,  ( C,F )). 

• If  x = 2 there  are  two:  (A,  ( C,F ))  and  ( B , ( C,F )). 

• If  x > 2,  there  is  only  one:  (B,  ( C,F )). 

(b)  The  strategic  form  is 


A 

Player  1 

B 


as  follows: 


Player  2 

CE  CF  DE  DF 


2 

5 

2 

5 

1 

0 

1 

0 

1 

0 

X 

2 

1 

0 

X 

2 

Firm  2 


(2.5, 2.5)  (2.5,3)  (3,2.5)  (3,3) 


147.5 

25 

147.5 

25 

117.5 

27.5 

117.5 

27.5 

132.5 

12.5 

100 

12.5 

132.5 

12.5 

100 

12.5 

First  note  that  {A,  ( C,E ))  is  a Nash  equilibrium  for  every  value  of  x. 
Now,  depending  on  the  value  of  x the  other  Nash  equilibria  are  as  follows: 

• IfxCl  :(A,(C,F)). 
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• If  1 < x < 2 , (A,  (| C,F ))  and  ( B , ( D,F )). 

• If  x = 2,  (H,  (C,F)),  (5,  (C,T))  and  (B,  (D,F)). 

• If  x > 2,  (5,  (C,Fj)  and  (B,  ( D,F )). 


Exercise  2.11.  Let  us  find  the  losing  positions.  If  player  i,  with  his  choice,  can 
bring  the  sum  to  40  then  he  can  win  (the  other  player  with  her  next  choice  will 
take  the  sum  to  a number  between  41  and  47  and  then  player  i can  win  with  his 
next  choice).  Working  backwards,  the  previous  losing  position  is  32  (from  here 
the  player  who  has  to  move  will  take  the  sum  to  a number  between  33  and  39 
and  after  this  the  opponent  can  take  it  to  40).  Reasoning  backwards,  the  earlier 
losing  positions  are  24,  16,  8 and  0.  Thus  Player  1 starts  from  a losing  position 
and  therefore  it  is  Player  2 who  has  a winning  strategy.  The  winning  strategy  is: 
at  every  turn,  if  Player  l’s  last  choice  was  n then  Player  2 should  choose  (8  -n). 

Exercise  2.12.  (a)  The  initial  part  of  the  game  is  shown  below. 


(b)  From  G4  Player  1 should  move  the  coin  to  H5.  From  there  Player  2 has  to 
move  it  to  H6  and  Player  1 to  H 7 and  Player  2 to  H8  and  from  there  Player 
1 wins  by  moving  it  to  H9. 

(c)  A\^B2^C3^D4-^E5^F5^G6^H7^H8^H9 

12  12  12  12  1 2 

(d)  HI— >R2— >C3-»Z)4— >£5— >F5— >G6— >G7— >7/7— >7/8— >7/9 

(e)  Using  backward  induction  we  can  label  each  cell  with  a W (meaning  that 
the  player  who  has  to  move  when  the  coin  is  there  has  a winning 
continuation  strategy)  or  with  an  L (meaning  that  the  player  who  has  to 
move  when  the  coin  is  there  can  be  made  to  lose).  If  all  the  cells  that  are 
accessible  from  a given  cell  are  marked  with  a W then  that  cell  must  be 
marked  with  an  L.  If  from  a cell  there  is  an  accessible  cell  marked  with  an  L 
then  that  cell  should  be  marked  with  a W.  See  the  following  picture. 
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From  the  picture  it  is  clear  that  it  is  Player  1 who  has  a winning  strategy. 
The  winning  strategy  of  Player  1 is:  move  the  coin  to  cell  B1  and  from  then 
on,  after  every  move  of  Player  2,  move  the  coin  to  a cell  marked  L. 


Exercise  2.13.  (a)  The  game  is  as  follows: 

give  up 

Anna  claim 
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(b)  The  backward-induction  solution  is  marked  by  thick  arrows  in  the  above 
figure. 

(c)  The  structure  of  the  game  is  as  follows: 


give  up 

Anna  claim 

f >• 


assert 

claim 


0 

C 


B 


Bess  X 


accept 


>• 


CA 

0 


Suppose  that  Anna  is  the  legal  owner  and  values  the  ring  more  than  Bess  does: 
CA  > CB.  At  the  last  node  Anna  will  choose  “match”  if  CA  > B and  “don’t 

match”  if  B > CA.  In  the  first  case  Bess’s  payoff  will  be  - F , while  in  the  second 
case  it  will  be  CB  -B  which  is  negative  since  B > CA  and  CA  > CB.  Thus  in 

either  case  Bess’s  payoff  would  be  negative.  Hence  at  her  decision  node  Bess 
will  choose  “accept”  (Bess  can  get  the  ring  at  this  stage  only  if  she  bids  more 
than  the  ring  is  worth  to  her).  Anticipating  this,  Anna  will  assert  her  claim  at 
the  first  decision  node.  Thus  at  the  backward-induction  solution  the  ring  goes 
to  Anna,  the  legal  owner.  The  payoffs  are  CA  for  Anna  and  0 for  Bess.  Note 
that  no  money  changes  hands. 
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(d)  Suppose  that  Bess  is  the  legal  owner  and  values  the  ring  more  than  Anna 
does:  CB  > CA.  At  the  last  node  Anna  will  choose  “match”  if  CA  > B and 

“don’t  match”  if  B > CA.  In  the  first  case  Bess’s  payoff  will  be  -F,  while  in 
the  second  case  it  will  be  CB  -B,  which  will  be  positive  as  long  as  CB  > B. 

Hence  at  her  decision  node  Bess  will  choose  to  challenge  and  bid  any 
amount  B such  that  CB  > B > CA.  Anticipating  this,  at  her  first  decision 
node  Anna  will  give  up  (and  get  a payoff  of  0),  because  if  she  asserted  her 
claim  then  her  final  payoff  would  be  -F.  Thus  at  the  backward-induction 
solution  the  ring  goes  to  Bess,  the  legal  owner.  The  payoffs  are  0 for  Anna 
and  CB  for  Bess.  Note  that  no  money  changes  hands. 

(e)  As  pointed  out  above,  in  both  cases  no  money  changes  hands  at  the 
backward-induction  solution.  Thus  Sabio  collects  no  money  at  all  and  both 
Ann  and  Bess  pay  nothing. 
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General  dynamic  games 

3.1  Imperfect  information 

There  are  many  situations  where  players  have  to  make  decisions  with  only 
partial  information  about  previous  moves  by  other  players.  Here  is  an 
example  from  my  professional  experience:  in  order  to  discourage  copying 
and  cheating  in  exams,  I prepare  two  versions  of  the  exam,  print  one  version  on 
white  paper  and  the  other  on  pink  paper  and  distribute  the  exams  in  such  a way 
that  if  a student  gets,  say,  the  white  version  then  the  students  on  his  left  and  right 
have  the  pink  version.  For  simplicity  let  us  assume  that  there  is  only  one  question 
in  the  exam.  What  matters  for  my  purpose  is  not  that  the  question  is  indeed 
different  in  the  two  versions,  but  rather  that  the  students  believe  that  they  are 
different  and  thus  refrain  from  copying  from  their  neighbors.  The  students, 
however,  are  not  naive  and  realize  that  I might  be  bluffing;  indeed,  introducing 
differences  between  the  two  versions  of  the  exam  involves  extra  effort  on  my  part. 
Consider  a student  who  finds  himself  in  the  embarrassing  situation  of  not  having 
studied  for  the  final  exam  and  is  tempted  to  copy  from  his  neighbor,  whom  he 
knows  to  be  a very  good  student.  Let  us  assume  that,  if  he  does  not  copy,  then  he 
turns  in  a blank  exam;  in  this  case,  because  of  his  earlier  grades  in  the  quarter,  he 
will  get  a C;  on  the  other  hand,  if  he  copies  he  will  get  an  A if  the  two  versions  are 
identical  but  will  be  caught  cheating  and  get  an  F if  the  two  versions  are  slightly 
different.  How  can  we  represent  such  a situation?  Clearly  this  is  a situation  in 
which  decisions  are  made  sequentially:  first  the  Professor  decides  whether  to  write 
identical  versions  (albeit  printed  on  different-color  paper)  or  different  versions  and 
then  the  Student  chooses  between  copying  and  leaving  the  exam  blank.  We  can 
easily  represent  this  situation  using  a tree  as  we  did  with  the  case  of  perfect- 
information  games,  but  the  crucial  element  here  is  the  fact  that  the  Student  does  not 
know  whether  the  two  versions  are  identical  or  different.  In  order  to  represent  this 
uncertainty  (or  lack  of  information)  in  the  mind  of  the  Student,  we  use  the  notion 
of  information  set.  An  information  set  for  a player  is  a collection  of  decision  nodes 
of  that  player  and  the  interpretation  is  that  the  player  does  not  know  at  which  of 
these  nodes  he  is  making  his  decision.  Graphically,  we  represent  an  information  set 
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by  enclosing  the  corresponding  nodes  in  a rounded  rectangle.  Figure  3.1  represents 
the  situation  described  above. 


PROFESSOR 


Student 
gets  A 


Student  Student  caught 

gets  C cheating,  gets  F 


Student 
gets  C 


Figure  3.1 

An  extensive  form,  or  frame,  with  imperfect  information 

As  usual  we  need  to  distinguish  between  a game-frame  and  a game.  Figure  3.1 
depicts  a game-frame:  in  order  to  obtain  a game  from  it  we  need  to  add  a ranking  of 
the  outcomes  for  each  player.  For  the  moment  we  shall  ignore  payoffs  and  focus  on 
frames.  A game-frame  such  as  the  one  shown  in  Figure  3.1  is  called  an  extensive  form 
(or  frame)  with  imperfect  information-,  in  this  example  it  is  the  Student  who  has 
imperfect  information,  or  uncertainty,  about  the  earlier  decision  of  the  Professor. 

We  now  give  a general  definition  of  extensive  form  that  allows  for  perfect 
information  (Definition  2.1,  Chapter  2)  as  a special  case.  The  first  four  items  of 
Definition  3.1,  marked  by  the  bullet  symbol  •,  coincide  with  Definition  2.1;  what  is 
new  is  the  additional  item  marked  by  the  symbol  ♦ . 

First  some  additional  terminology  and  notation.  Given  a directed  tree  and  two 
nodes  x and  y we  say  that  y is  a successor  of  x or  x is  a predecessor  of  y if  there  is  a 
sequence  of  directed  edges  from  x to  y (if  the  sequence  consists  of  a single  directed 
edge  then  we  say  that  y is  an  immediate  successor  ofx  or  x is  the  immediate  predecessor 
ofy).  A partition  of  a set  H is  a collection  3d  = j II m } (m>  1)  of  non-empty 

subsets  of  H such  that  (1)  any  two  elements  of  3f  are  disjoint  (if  H fHk  e3f 
with  j Y k then  If  n If  =0)  and  (2)  together  the  elements  of  3d  cover  H: 
Hlu...uHm=H. 
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Definition  3.1.  A finite  extensive  form  (or  frame)  with  perfect  recall  consists  of  the 
following  items. 

• A finite  rooted  directed  tree. 

• A set  of  players  I = {1 ,...,«}  and  a function  that  assigns  one  player  to  every 
decision  node. 

• A set  of  actions  A and  a function  that  assigns  one  action  to  every  directed  edge, 
satisfying  the  restriction  that  no  two  edges  out  of  the  same  node  are  assigned  the 
same  action. 

• A set  of  outcomes  O and  a function  that  assigns  an  outcome  to  every  terminal 
node. 

• For  every  player  i e I , a partition  Di  of  the  set  D,  of  decision  nodes  assigned  to 
player  i (thus  Dt  is  a collection  of  mutually  disjoint  subsets  of  D whose  union 
is  equal  to  D ).  Each  element  of  Dt  is  called  an  information  set  of  player  i.  The 
elements  of  T)j  satisfy  the  following  restrictions: 

(1)  the  actions  available  at  any  two  nodes  in  the  same  information  set  must  be 
the  same  (that  is,  for  every  Del).,  if  x,  y e D then  the  outdegree  of  x is 
equal  to  the  outdegree  of  y and  the  set  of  actions  assigned  to  the  directed 
edges  out  of  x is  equal  to  the  set  of  actions  assigned  to  the  directed  edges  out 
ofy), 

(2)  if  x and  y are  two  nodes  in  the  same  information  set  then  it  is  not  the  case 
that  one  node  is  a predecessor  of  the  other, 

(3)  each  player  has  perfect  recall  in  the  sense  that  if  node  x e D e Df  is  a 
predecessor  of  node  yeD'e  Dj  (thus,  by  (2),  D ^ D'),  and  a is  the  action 
assigned  to  the  directed  edge  out  of  x in  the  sequence  of  edges  leading  from  x 
to  y,  then  for  every  node  z e D'  there  is  a predecessor  we  D such  that  the 
action  assigned  to  the  directed  edge  out  of  w in  the  sequence  of  edges  leading 
from  w to  z is  that  same  action  a. 

The  perfect-recall  restriction  says  that  if  a player  takes  action  a at  an  information  set 
and  later  on  has  to  move  again,  then  at  the  later  time  she  remembers  that  she  took 
action  a at  that  earlier  information  set  (because  every  node  she  is  uncertain  about  at 
the  later  time  comes  after  taking  action  a at  that  information  set).  Perfect  recall  can 
be  interpreted  as  requiring  that  a player  always  remember  what  she  knew  in  the  past 
and  what  actions  she  herself  took  in  the  past.  Figure  3.2  shows  two  examples  of 
violation  of  perfect  recall.  In  the  frame  shown  in  Panel  (i)  Player  1 first  chooses 
between  a and  b and  then  chooses  between  c and  d having  forgotten  his  previous 
choice:  he  does  not  remember  what  he  chose  previously.  In  the  frame  shown  in 
Panel  {it}  when  Player  2 has  to  choose  between  e and/she  is  uncertain  whether  this 
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is  the  first  time  she  moves  (left  node)  or  the  second  time  (right  node):  she  is 
uncertain  whether  she  moved  in  the  past. 


Figure  3.2 

Examples  of  violations  of  perfect  recall 


If  every  information  set  of  every  player  consists  of  a single  node,  then  the  frame  is 
said  to  be  a perfect-information  frame : it  is  easy  to  verify  that,  in  this  case,  the  last 
item  of  Definition  3.1  (marked  by  the  symbol  ♦)  is  trivially  satisfied  and  thus 
Definition  3.1  coincides  with  Definition  2.1  (Chapter  2).  Otherwise  (that  is,  if  at 
least  one  player  has  at  least  one  information  set  that  consists  of  at  least  two  nodes), 
the  frame  is  said  to  have  imperfect  information.  An  example  of  an  extensive  frame 
with  imperfect  information  is  the  one  shown  in  Figure  3.1.  We  now  give  two  more 
examples.  In  order  to  simplify  the  figures,  when  representing  an  extensive  frame  we 
enclose  an  information  set  in  a rounded  rectangle  if  and  only  if  that  information  set 
contains  at  least  two  nodes. 

Example  3.1.  There  are  three  players,  Ann,  Bob  and  Carla.  Initially,  Ann  and  Bob 
are  in  the  same  room  and  Carla  is  outside  the  room.  Ann  moves  first,  chooses 
either  a red  card  or  a black  card  from  a full  deck  of  cards,  shows  it  to  Bob  and  puts 
it,  face  down,  on  the  table.  Now  Carla  enters  the  room  and  Bob  makes  a statement 
to  Carla:  he  either  says  “Ann  chose  a Red  card”  or  he  says  “Ann  chose  a Black 
card”;  thus  Bob  could  be  lying  or  could  be  telling  the  truth.  After  hearing  Bob’s 
statement  Carla  guesses  the  color  of  the  card  that  was  picked  by  Ann.  The  card  is 
then  turned  and  if  Carla’s  guess  was  correct  then  Ann  and  Bob  give  $1  each  to 
Carla,  otherwise  Carla  gives  $1  to  each  of  Ann  and  Bob.  When  drawing  an 
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extensive  frame  to  represent  this  situation,  it  is  important  to  be  careful  about  what 
Carla  knows,  when  she  makes  her  guess,  and  what  she  is  uncertain  about.  The 
extensive  frame  is  shown  in  Figure  3.3. 


ANN 


$2  -$2  — $2  $2 

Figure  3.3 

The  extensive  form,  or  frame,  representing  Example  3.1. 

Carla’s  top  information  set  captures  the  situation  she  is  in  after  hearing  Bob  say 
“Ann  chose  a red  card”  and  not  knowing  if  he  is  telling  the  truth  (left  node)  or  he  is 
lying  (right  node).  Carla’s  bottom  information  set  captures  the  alternative  situation 
where  she  hears  Bob  say  “Ann  chose  a black  card”  and  does  not  know  if  he  is  lying 
(left  node)  or  telling  the  truth  (right  node).  In  both  situations  Carla  knows 
something,  namely  what  Bob  tells  her,  but  lacks  information  about  something  else, 
namely  what  Ann  chose.  The  fact  that  Bob  knows  the  color  of  the  card  chosen  by 
Ann  is  captured  by  giving  Bob  two  information  sets,  each  consisting  of  a single 
node:  Bob’s  left  node  represents  the  situation  he  is  in  when  he  sees  that  Ann  picked 
a red  card,  while  his  right  node  represents  the  situation  he  is  in  when  he  sees  that 
Ann  picked  a black  card. 
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Example  3.2.  Yvonne  and  Fran  were  both  interviewed  for  the  same  job,  but  only- 
one  person  can  be  hired.  The  employer  told  each  candidate:  “don’t  call  me,  I will 
call  you  if  I want  to  offer  you  the  job”.  He  also  told  them  that  he  desperately  needs 
to  fill  the  position  and  thus,  if  turned  down  by  one  candidate,  he  will  automatically 
make  the  offer  to  the  other  candidate,  without  revealing  whether  he  is  making  a 
first  offer  or  a “recycled”  offer.  This  situation  is  represented  in  the  extensive  frame 
shown  in  Figure  3.4. 


Figure  3.4 

The  extensive  form,  or  frame,  representing  Example  3.2. 


As  before,  in  order  to  obtain  a game  from  an  extensive  frame  all  we  need  to  do  is 
add  a ranking  of  the  outcomes  for  each  player.  As  usual,  the  best  way  to  represent 
such  rankings  is  by  means  of  an  ordinal  utility  function  for  each  player  and  thus 
represent  an  extensive-form  game  by  associating  a vector  of  utilities  with  each 
terminal  node.  For  instance,  expanding  on  Example  3.2,  suppose  that  the  employer 
only  cares  about  whether  the  position  is  filled  or  not,  prefers  filling  the  position  to 
not  filling  it,  but  is  indifferent  between  filling  it  with  Yvonne  or  with  Fran;  thus  we 
can  assign  a utility  of  1 for  the  employer  to  every  outcome  where  one  of  the  two 
candidates  accepts  the  offer  and  a utility  of  0 to  every  other  outcome.  Yvonne’s 
favorite  outcome  is  to  be  hired  if  she  was  the  recipient  of  the  first  call  by  the 
employer;  her  second  best  outcome  is  not  to  be  hired  and  her  worst  outcome  is  to 
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accept  a recycled  offer  (in  the  latter  case  Fran  would  have  a blast  telling  Yvonne 
“You  took  that  job?!  It  was  offered  to  me  but  I turned  it  down.  Who,  in  her  right 
mind,  would  want  that  job?  What’s  wrong  with  you?!”).  Thus  for  Yvonne  we  can 
use  utilities  of  2 (if  she  accepts  a first  offer),  1 (if  she  is  not  hired)  and  0 (if  she  accepts 
a recycled  offer).  Finally,  suppose  that  Fran  has  preferences  similar  (but  symmetric) 
to  Yvonne’s.  Then  the  extensive  frame  of  Figure  3.4  gives  rise  to  the  extensive  game 
shown  in  Figure  3.5. 

1 o 1 Employer's  utility 

0 2 2 Yvonne's  utility 

12  1 Fran's  utility 


1 1 1 

2 0 1 

Figure  3.5 

A game  based  on  the  extensive  form  of  Figure  3.4. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  3.E.1  of  Appendix  3.E  at  the 
end  of  this  chapter. 
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3.2.  Strategies 

The  notion  of  strategy  for  general  extensive  games  is  the  same  as  before:  a strategy 
for  Player  i is  a complete,  contingent  plan  that  covers  all  the  possible  situations 
Player  i might  find  herself  in.  In  the  case  of  a perfect-information  game  a “possible 
situation”  for  a player  is  a decision  node  of  that  player;  in  the  general  case,  where 
there  may  be  imperfect  information,  a “possible  situation”  for  a player  is  an 
information  set  of  that  player.  The  following  definition  reduces  to  Definition  2.4 
(Chapter  2)  if  the  game  is  a perfect-information  game  (where  each  information  set 
consists  of  a single  node). 

Definition  3.2.  A strategy  for  a player  in  an  extensive-form  game  is  a list  of 
choices,  one  for  every  information  set  of  that  player. 

For  example,  in  the  game  of  Figure  3.5,  Yvonne  has  only  one  information  set  and 
thus  a strategy  for  her  is  what  to  do  at  that  information  set,  namely  either  say  Yes 
or  say  No.  Yvonne  cannot  make  the  plan  “if  the  employer  calls  me  first  I will  say 
Yes  and  if  he  calls  me  second  I will  say  No”,  because  when  she  receives  the  call  she 
is  not  told  if  this  is  a first  call  or  a recycled  call  and  thus  she  cannot  make  her 
decision  depend  on  information  she  does  not  have. 

As  in  the  case  of  perfect-information  games,  the  notion  of  strategy  allows  us  to 
associate  with  every  extensive-form  game  a strategic-form  game.  For  example,  the 
strategic  form  associated  with  the  game  of  Figure  3.5  is  shown  in  Table  3.6  with  the 
Nash  equilibria  highlighted. 


Yvonne 

Yes  No 


Table  3.6 

The  strategic  form  of  the  game  of  Figure  3.5 
with  the  Nash  equilibria  highlighted. 
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As  another  example,  consider  the  extensive  form  of  Figure  3.3  and  view  it  as  a game 
by  assuming  that  each  player  is  selfish  and  greedy  (only  cares  about  how  much 
money  he/she  gets  and  prefers  more  money  to  less).  Then  the  associated  strategic 
forms  is  shown  in  Table  3.7,  where  Bob’s  strategy  (x,y)  means  “I  say  x if  Ann  chose 
a black  card  and  I say  y if  Ann  chose  a red  card”.  Thus  (R,B)  means  “if  Ann  chose  a 
black  card  I say  Red  and  if  Ann  chose  a red  card  I say  Black”  (that  is,  Bob  plans  to 
lie  in  both  cases).  Similarly,  Carla’s  strategy  (x,y)  means  “I  guess  x if  Bob  tells  me 
Black  and  I guess  y if  Bob  tells  me  Red”.  Thus  (B,R)  means  “if  Bob  tells  me  Black  I 
guess  Black  and  if  Bob  tells  me  Red  I guess  Red”  (that  is,  Carla  plans  to  repeat  what 
Bob  says). 


BOB 

B,B  R,R  B,R  R,B 


B -1,-1, 2 -1,-1, 2 -1,-1, 2 -1,-1, 2 
A 

N 

N „ 

R 1,1, -2  1,1, -2  1,1, -2  1,1, -2 


BOB 

B,B  R,R  B,R  R,B 


B 1,1, -2  1,1, -2  1,1, -2  1,1, -2 


R -1,-1, 2 -1,-1, 2 -1,-1, 2 -1,-1, 2 


CARLA:  B,  B 


CARLA:  R,  R 


A 

N 

N 


BOB  BOB 


B,B 

R,R 

B,R 

R,B 

B,B 

R,R 

B,R 

R,B 

B 

-1,-1, 2 

1,1, -2 

-1,-1, 2 

1,1, -2 

B 

A 

U, -2 

-1,-1, 2 

1,1, -2 

-1,-1, 2 

N 

R 

1,1, -2 

-1,-1, 2 

-1,-1, 2 

1,1, -2 

N R 

-1,-1, 2 

1,1, -2 

1,1, -2 

-1,-1, 2 

CARLA:  B,  R 


CARLA:  R,  B 


Table  3.7 

The  strategic  form  of  the  game  of  Figure  3.3. 

In  order  to  “solve”  an  extensive-form  game  we  could  simply  construct  the 
associated  strategic-form  game  and  look  for  the  Nash  equilibria.  However,  we  saw 
in  Chapter  2 that  in  the  case  of  perfect-information  games  not  all  Nash  equilibria  of 
the  associated  strategic  form  can  be  considered  “rational  solutions”  and  we 
introduced  the  notion  of  backward  induction  to  select  the  “reasonable”  Nash 
equilibria.  What  we  now  need  is  a generalization  of  the  notion  of  backward 
induction  that  can  be  applied  to  general  extensive-form  games.  This  generalization 
is  called  subgame-perfect  equilibrium.  First  we  need  to  define  the  notion  of  subgame. 
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This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  3.E.2  of  Appendix  3.E  at  the 
end  of  this  chapter. 


3.3.  Subgames 

Roughly  speaking,  a subgame  of  an  extensive-form  game  is  a portion  of  the  game 
that  could  be  a game  in  itself.  What  we  need  to  be  precise  about  is  the  meaning  of 
“portion  of  the  game”. 

Definition  3.3.  A proper  subgame  of  an  extensive-form  game  is  obtained  as 
follows: 

(1)  start  from  a decision  node  x,  different  from  the  root,  whose  information  set 
consists  of  node  x only  and  enclose  in  an  oval  node  x itself  and  all  its 
successors, 

(2)  if  the  oval  does  not  “cut”  any  information  sets  (that  is,  there  is  no 
information  set  S and  two  nodes  y,z  e S such  that  y is  a successor  of  x 
while  z is  not)  then  what  is  included  in  the  oval  is  a proper  subgame, 
otherwise  it  is  not. 

The  reason  why  we  use  the  qualifier  ‘proper’  is  that  one  could  start  from  the  root, 
in  which  case  one  would  end  up  taking  the  entire  game  and  consider  this  as  a 
(trivial)  subgame  (just  like  any  set  is  a subset  of  itself;  a proper  subgame  is  analogous 
to  a proper  subset). 

Consider,  for  example,  the  extensive-form  game  of  Figure  3.8  below.  There  are 
three  possible  starting  points  for  identifying  a proper  subgame:  nodes  x,  y and  2 (the 
other  nodes  fail  to  satisfy  condition  (1)  of  Definition  3.3). 

• Starting  from  node  x and  including  all  of  its  successors,  we  do  indeed  obtain 
a proper  subgame,  which  is  the  portion  included  in  the  blue  oval  on  the  left. 

• Starting  from  node  y and  including  all  of  its  successors  we  obtain  the 
portion  of  the  game  that  is  included  in  the  red  oval  on  the  right;  in  this  case, 
condition  (2)  of  Definition  3.3  is  violated,  since  we  are  cutting  the  top 
information  set  of  Player  3;  hence  the  portion  of  the  game  inside  the  red 
oval  is  not  a proper  subgame. 

• Finally,  starting  from  node  2 and  including  all  of  its  successors,  we  do  obtain 
a proper  subgame,  which  is  the  portion  included  in  the  purple  oval  at  the 
bottom. 
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Thus  the  game  of  Figure  3.8  has  two  proper  subgames. 


Figure  3.8 

An  extensive-form  game  and  its  proper  subgames. 

Definition  3.4.  A proper  subgame  of  an  extensive-form  game  is  called 
minimal  if  it  does  not  strictly  contain  another  proper  subgame  (that  is,  if  there 
is  no  other  proper  subgame  which  is  contained  in  it  and  does  not  coincide  with 
it). 

For  example,  the  game  shown  in  Figure  3.9  below  has  three  proper  subgames,  one 
starting  at  node  x,  another  at  node  y and  the  third  at  node  z.  The  ones  starting  at 
nodes  x and  z are  minimal  subgames,  while  the  one  that  starts  at  node  y is  not  a 
minimal  subgame,  since  it  contains  the  one  that  starts  at  node  z. 
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1 


Figure  3.9 

An  extensive-form  game  with  three  proper  subgames,  two  of  which  are  minimal. 


Jt^mi  This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  3.E.3  of  Appendix  3.E  at  the 
end  of  this  chapter. 
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3.4.  Subgame-perfect  equilibrium 


A subgame-perfect  equilibrium  of  an  extensive-form  game  is  a Nash  equilibrium  of 
the  entire  game  which  remains  an  equilibrium  in  every  proper  subgame.  Consider 
an  extensive-form  game  and  let  s be  a strategy  profile  for  that  game.  Let  G be  a 
proper  subgame.  Then  the  restriction  of  s to  G,  denoted  by  s \G , is  that  part  of  s 
which  prescribes  choices  at  every  information  set  of  G and  only  at  those 
information  sets.  For  example,  consider  the  extensive-form  game  of  Figure  3.9 

f \ 


above  and  the  strategy  profile 


0 a,C),(d,f,E ),  (A£) 

V1'S  strategy  2 's  strategy  3 's  strategy  y 


. Let  G be  the 


subgame 


that  starts  at  node  y of  Player  2.  Then 


s L = 


B 


3's  strategy 
in  G 


Definition  3.5.  Given  an  extensive-form  game,  let  s be  a strategy  profile  for 
the  entire  game.  Then  s is  a subgame-perfect  equilibrium  if 

(1)  s is  a Nash  equilibrium  of  the  entire  game  and 

(2)  for  every  proper  subgame  G,  s |G  (the  restriction  of  s to  G)  is  a Nash 
equilibrium  of  G. 

For  example,  consider  again  the  extensive-form  game  of  Figure  3.9  and  the  strategy 
profile  s = ((a,C),(d,f,E),(h,B)).  Then  s is  a Nash  equilibrium  of  the  entire 

game:  Player  l’s  payoff  is  2 and,  if  he  were  to  switch  to  any  strategy  where  he  plays 
b,  his  payoff  would  be  0;  Player  2’s  payoff  is  1 and  if  she  were  to  switch  to  any 
strategy  where  she  plays  c her  payoff  would  be  0;  Player  3’s  payoff  is  2 and  if  he 
were  to  switch  to  any  strategy  where  he  plays  g his  payoff  would  be  0.  However,  s 
is  not  a subgame-perfect  equilibrium,  because  the  restriction  of  s to  the  proper 
subgame  that  starts  at  node  2 of  Player  1,  namely  ( C,E ) , is  not  a Nash  equilibrium 
of  that  subgame:  in  that  subgame,  for  Player  2 the  unique  best  reply  to  C is  F. 

One  way  of  finding  the  subgame-perfect  equilibria  of  a given  game  is  to  first  find 
the  Nash  equilibria  and  then,  for  each  of  them,  check  if  it  satisfies  condition  (2)  of 
Definition  3.5.  However,  this  is  not  a practical  way  to  proceed.  A quicker  and 
easier  way  is  to  apply  the  following  algorithm,  which  generalizes  the  backward- 
induction  algorithm  for  games  with  perfect  information  (Definition  2.3,  Chapter  2). 
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Definition  3.6.  Given  an  extensive-form  game,  the  subgame-perfect  equilibrium 
algorithm  is  the  following  procedure. 

1.  Start  with  a minimal  proper  subgame  and  select  a Nash  equilibrium  of  it. 

2.  Delete  the  selected  proper  subgame  and  replace  it  with  the  payoff  vector 
associated  with  the  selected  Nash  equilibrium,  making  a note  of  the  strategies 
that  constitute  the  Nash  equilibrium.  This  yields  a smaller  extensive-form 
game. 

3.  Repeat  Steps  1 and  2 in  the  smaller  game  so  obtained. 

For  example,  let  us  apply  the  algorithm  to  the  game  of  Figure  3.9.  Begin  with  the 
proper  subgame  that  starts  at  node  x of  Player  2,  shown  below  with  its  associated 
strategic  form,  where  the  unique  Nash  equilibrium  ( d , h ) is  highlighted.  Note  that 
this  is  a game  only  between  Players  2 and  3 and  thus  in  Figure  3.10  we  only  show 
the  payoffs  of  these  two  players. 


Figure  3.10 

A minimal  proper  subgame  of  the  game  of  Figure  3.9  and  its  strategic  form. 


Now  we  delete  the  proper  subgame,  thereby  turning  node  x into  a terminal  node 
to  which  we  attach  the  full  payoff  vector  associated,  in  the  original  game,  with  the 
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terminal  node  following  history  adh,  namely  (2,1,2).  Hence  we  obtain  the  smaller 
game  shown  in  Figure  3.11. 


1 


o 110 

4 16  8 

Figure  3.11 

The  reduced  game  after  replacing  a proper  minimal  subgame 
in  the  game  of  Figure  3.9. 


Now,  in  the  reduced  game  of  Figure  3.11  we  select  the  only  minimal  proper 
subgame,  namely  the  one  that  starts  at  the  bottom  decision  node  of  Player  1.  This 
subgame  is  shown  in  Figure  3.12  together  with  its  associated  strategic  form.  The 
unique  Nash  equilibrium  of  that  subgame  is  (C,  F ) . 
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Figure  3.12 

A minimal  proper  subgame  of  the  game  of  Figure  3.11  and  its  strategic  form. 

Then,  in  the  reduced  game  of  Figure  3.11,  we  replace  the  selected  proper  subgame 
with  the  payoff  vector  associated  with  the  history  beACF,  namely  (1,1,1),  thus 
obtaining  the  smaller  game  shown  in  Figure  3.13. 

1 


(C,F)  j 2 2 0 

1 Oil 


Figure  3.13 

The  reduced  game  after  replacing  a proper  minimal  subgame  in  the  game  of  Figure  3.11. 
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The  game  of  Figure  3.13  has  a unique  proper  subgame,  which  has  a unique  Nash 
equilibrium,  namely  (/,  A) . Replacing  the  subgame  with  the  payoff  vector 
associated  with  the  history  b/A  we  get  the  following  smaller  game. 

1 


(d,h) 


2 

1 

2 


3 

2 

1 


(U) 

(C,F) 


Figure  3.14 

The  reduced  game  after  replacing  the  subgame  in  the  game  of  Figure  3.13. 


In  the  reduced  game  of  Figure  3.14  the  unique  Nash  equilibrium  is  b.  Now  patching 
together  the  choices  selected  during  the  application  of  the  algorithm  we  get  the 
following  subgame-perfect  equilibrium  for  the  game  of  Figure  3.9: 
(0 b,C),(d,f,F),(h,A )). 


As  a second  example,  consider  the  game  of  Figure  3.15,  which  reproduces  the  game 
of  Figure  3.8. 
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3 2 0 1 

0 12  0 


4 

1 

2 


1 

2 

4 


Figure  3.15 

Reproduction  of  the  game  of  Figure  3.8. 


Begin  with  the  subgame  that  starts  at  node  x and  replace  it  with  the  payoff  vector 
(3,1,2).  Next  replace  the  subgame  that  starts  at  node  2 with  the  payoff  vector  (3,2,1) 
which  corresponds  to  the  Nash  equilibrium  (e,h)  of  that  subgame,  so  that  the  game 
is  reduced  to  the  one  shown  in  Figure  3.16,  together  with  its  strategic  form. 
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(e.h)  2 0 

1 3 


4 

1 

2 


1 

2 

4 


Player  1 


L 

R 


Player  2 


Player  1 


L 

R 


Player  3:  a 

Player  2 
U 


D 


3 

2 

3 

2 

4 

1 

2 

0 

Player  3:  b 


Figure  3.16 

The  game  of  Figure  3.15  reduced  after  solving  the  proper  subgames  and  the  associated 
strategic  form  with  the  Nash  equilibria  highlighted. 
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The  reduced  game  of  Figure  3.16  has  two  Nash  equilibria:  ( L,D,a ) and  (R,U,b).  Thus 
the  game  of  Figure  3.15  has  two  subgame-perfect  equilibria: 


(L,c)  , (D,e)  , ( a,h ) 

y Player  1 Player 2 Player 3 y 


and 


(P,c)  , OAe) 

V Player l PI  aver  2 


(M) 

Player  3 J 


Remark  3.1.  As  shown  in  the  last  example,  it  is  possible  that  - when  applying 
the  subgame-perfect  equilibrium  algorithm  - one  encounters  a proper  subgame 
or  a reduced  game  that  has  several  Nash  equilibria.  In  this  case  one  Nash 
equilibrium  must  be  selected  to  continue  the  procedure  and  in  the  end  one 
obtains  one  subgame-perfect  equilibrium.  One  then  has  to  repeat  the  procedure 
by  selecting  a different  Nash  equilibrium  and  thus  obtain  a different  subgame- 
perfect  equilibrium,  and  so  on.  This  is  similar  to  what  happens  with  the 
backward-induction  algorithm  in  perfect-information  games. 

Remark  3.2.  It  is  also  possible  that  - when  applying  the  subgame-perfect 
equilibrium  algorithm  - one  encounters  a proper  subgame  or  a reduced  game 
that  has  no  Nash  equilibria.  " In  such  a case  the  game  under  consideration  does 
not  have  any  subgame-perfect  equilibria. 

Remark  3.3.  When  applied  to  perfect-information  games,  the  notion  of 
subgame-perfect  equilibrium  coincides  with  the  notion  of  backward-induction 
solution.  Thus  subgame-perfect  equilibrium  is  a generalization  of  backward 
induction. 

Remark  3.4.  For  extensive-form  games  that  have  no  proper  subgames  (for 
example,  the  game  of  Figure  3.3)  the  set  of  Nash  equilibria  coincides  with  the 
set  of  subgame-perfect  equilibria.  In  general,  however,  the  notion  of  subgame- 
perfect  equilibrium  is  a refinement  of  the  notion  of  Nash  equilibrium. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  3.E.4  of  Appendix  3.E  at  the 
end  of  this  chapter. 


12  We  will  see  in  Part  II  that,  when  payoffs  are  cardinal  and  one  allows  for  mixed  strategies,  then 
every  finite  game  has  at  least  one  Nash  equilibrium  in  mixed  strategies. 
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3.5.  Games  with  chance  moves 

So  far  we  have  only  considered  games  where  the  outcomes  do  not  involve  any 
uncertainty.  As  a way  of  introducing  the  topic  discussed  in  Part  II,  in  this  section 
we  consider  games  where  uncertain,  probabilistic  events  are  incorporated  in  the 
extensive  form. 

We  begin  with  an  example.  There  are  three  cards,  one  black  and  two  red.  They  are 
shuffled  well  and  put  face  down  on  the  table.  Adele  picks  the  top  card,  looks  at  it 
without  showing  it  to  Ben  and  then  tells  Ben  either  “the  top  card  is  black”  or  “the 
top  card  is  red”:  she  could  be  telling  the  truth  or  she  could  be  lying.  Ben  then  has  to 
guess  the  true  color  of  the  top  card.  If  he  guesses  correctly  he  gets  $9  from  Adele, 
otherwise  he  gives  her  $9.  How  can  we  represent  this  situation?  Whether  the  top 
card  is  black  or  red  is  not  the  outcome  of  a player’s  decision,  but  the  outcome  a 
random  event,  namely  the  shuffling  of  the  cards.  In  order  to  capture  this  random 
event  we  introduce  a fictitious  player  called  Nature  or  Chance.  We  assign  a 
probability  distribution  to  Nature’s  “choices”.  In  this  case,  since  one  card  is  black 
and  the  other  two  are  red,  the  probability  that  the  top  card  is  black  is  } and  the 
probability  that  the  top  card  is  red  is  Note  that  we  don’t  assign  payoffs  to 
Nature  and  thus  the  only  real  players  are  Adele  and  Ben.  The  situation  can  be 
represented  as  shown  in  Figure  3.17,  where  the  numbers  associated  with  the 
terminal  nodes  are  dollar  amounts. 

NATURE 


Figure  3.17 

An  extensive  form  with  a chance  move. 
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Clearly,  the  notion  of  strategy  is  not  affected  by  the  presence  of  chance  moves.  In 
the  game  of  Figure  3.17  Adele  has  four  strategies  and  so  does  Ben.  However,  we  do 
encounter  a difficulty  when  we  try  to  write  the  associated  strategic  form.  For 
example,  consider  the  following  strategy  profile:  where  Adele’s 

strategy  is  to  be  truthful  (say  Black  if  she  sees  a black  card  and  Red  if  she  sees  a red 
card)  and  Ben’s  strategy  is  to  guess  Black  no  matter  what  Adele  says.  What  is  the 
outcome  in  this  case?  It  depends  on  what  the  true  color  of  the  top  card  is  and  thus 
the  outcome  is  a probabilistic  one: 


/ outcome  Adele  gives  $9  to  Ben  Ben  gives  $9  to  Adele 

probability  — — 

V 3 3 J 


We  call  such  probabilistic  outcomes  lotteries.  In  order  to  convert  the  game-frame 
into  a game  we  need  to  specify  how  the  players  rank  probabilistic  outcomes. 
Consider  the  case  where  Adele  is  selfish  and  greedy,  in  the  sense  that  she  only  cares 
about  her  own  wealth  and  she  prefers  more  money  to  less.  Then,  from  her  point  of 
view,  the  above  probabilistic  outcome  reduces  to  the  following  monetary  lottery 
( -$9  $9  3 

If  Ben  is  also  selfish  and  greedy,  then  he  views  the  same  outcome  as 


the  lottery 


$9  -$9 


V 3 


3 ) 


How  do  we  convert  a lottery  into  a payoff  or  utility?  The  general  answer  to  this 
question  will  be  provided  in  Chapter  4.  Here  we  consider  one  possibility,  which  is 
particularly  simple. 


Definition  3.7.  Given  a lottery  whose  outcomes  are  sums  of  money 

^$Xj  $x,  ...  $X  3 

(Pi  > 0,  for  all  /'  = 1,2, ...,n  and  px  + p2  +...  + pn  = 1) 

l A Pi  - Pn  ) 

the  expected  value  of  the  lottery  is  the  sum  of  money 

$(xlpl+x2p2+...  + xnpn). 


^$5  $15  $25 

2 


i 

V 5 


2 

5 J 


For  example,  the  expected  value  of  the  lottery 
$[5(|)  + 15(|) + 25(|)]  = $(1  + 6 + 10)  = $17  and  the  expected  value  of  the  lottery 
is  $3.  We  call  lotteries  whose  outcomes  are  sums  of  money,  money 


"-$9  $9  ^ 


2 

3 J 


lotteries. 
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Definition  3.8.  A player  is  defined  to  be  risk  neutral  if  she  considers  a money 
lottery  to  be  just  as  good  as  its  expected  value.  Hence  a risk  neutral  person 
ranks  money  lotteries  according  to  their  expected  value. 


For  example,  consider  the  following  money  lotteries:  Z,  = 


$5  $15  $25' 


I 
V 5 


2 

5 J 


l2  = 


r $16^1  , f$0  $32  $48^ 


V 1 J 


and  L , = 


4 y 


. The  expected  value  of  Z,  is  $17  and  the 


expected  value  of  both  Z2  andZ3  is  $16.  Thus  a risk-neutral  player  would  have 
the  following  ranking:  Z,  >-  Z2  ~ Z3 , that  is,  she  would  prefer  Z,  to  Z2  and  be 


indifferent  between  Z2  and  Z3 . 


For  a selfish  and  greedy  player  who  is  risk  neutral  we  can  take  the  expected  value  of 
a money  lottery  as  the  utility  of  that  lottery.  For  example,  if  we  make  the 
assumption  that,  in  the  extensive  form  of  Figure  3.17,  Adele  and  Ben  are  selfish, 
greedy  and  risk  neutral  then  we  can  associate  a strategic-form  game  to  it  as  shown  in 
Table  3.18.  Note  that  inside  each  cell  we  have  two  numbers:  the  first  is  the  utility 
(=  expected  value)  of  the  underlying  money  lottery  as  perceived  by  Adele  and  the 
second  number  is  the  utility  (=  expected  value)  of  the  underlying  money  lottery  as 
perceived  by  Ben. 


BB 

BR 

Adele 

RB 

RR 


Ben 


Figure  3.18 

The  strategic  form  of  the  game  of  Figure  3.17  when  the  two  players  are 
selfish,  greedy  and  risk  neutral.  The  first  element  of  Adele’s  strategy  is  what 
she  says  if  she  sees  a black  card  and  the  second  element  what  she  says  if  she 
sees  a red  card.  The  first  element  of  Ben’s  strategy  is  what  he  guesses  if  Adele 
says  “Red”,  the  second  element  what  he  guesses  if  Adele  says  “Black”. 
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We  conclude  this  section  with  one  more  example. 


Example  3.3.  There  are  three  unmarked,  opaque  envelopes.  One  contains 
$100,  one  contains  $200  and  the  third  contains  $300.  They  are  shuffled  well  and 
then  one  envelope  is  given  to  Player  1 and  another  is  given  to  Player  2 (the 
third  one  remains  on  the  table).  Player  1 opens  her  envelope  and  checks  its 
content  without  showing  it  to  Player  2.  Then  she  either  says  “pass”,  in  which 
case  each  player  gets  to  keep  his/her  envelope,  or  she  asks  Player  2 to  trade  his 
envelope  for  hers.  Player  2 is  not  allowed  to  see  the  content  of  his  envelope  and 
has  to  say  either  Yes  or  No.  If  he  says  No,  then  the  two  players  get  to  keep 
their  envelopes.  If  Player  2 says  Yes,  then  they  trade  envelopes  and  the  game 
ends.  Each  player  is  selfish,  greedy  and  risk  neutral. 

This  situation  is  represented  by  the  extensive-form  game  shown  in  Figure  3.19, 
where  (100,200)  means  that  Player  1 gets  the  envelope  with  $100  and  Player  2 
gets  the  envelope  with  $200,  etc.;  P stands  for  “pass”  and  T for  “suggest  a 
trade”;  Y for  “Yes”  and  N for  “No”. 


NATURE  (all  choices  have  an  equal 


Figure  3.19 

The  extensive-form  game  of  Example  3.3 


In  this  game  Player  1 has  eight  strategies.  One  possible  strategy  is:  “if  I get  $100  I 
will  pass,  if  I get  $200  I will  propose  a trade,  if  I get  $300  I will  pass”:  we  will  use  the 
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shorthand  PTP  for  this  strategy.  Similarly  for  the  other  strategies.  Player  2 has  only 
two  strategies:  Yes  and  No.  The  strategic  form  associated  with  the  game  of  Figure 
3.19  is  shown  in  Table  3.20,  where  the  Nash  equilibria  are  highlighted. 


Player  2 

Y 

N 

p 

PPP 

200 , 200 

200 , 200 

1 

PPT 

150 , 250 

200 , 200 

a 

PTP 

200 , 200 

200 , 200 

y 

PTT 

150 , 250 

200 , 200 

e 

TPP 

250  , 150 

200 , 200 

r 

TPT 

200 , 200 

200 , 200 

1 

TTP 

250  , 150 

200 , 200 

TTT 

200 , 200 

200 , 200 

Table  3.20 

The  strategic  form  of  the  game  of  Figure  3.19 

How  did  we  get  those  payoffs?  Consider,  for  example,  the  first  cell.  Given  the 
strategies  PPP  and  Y,  the  outcomes  are  ($100, $200)  with  probability  1/6, 
($100, $300)  with  probability  1/6,  ($200, $100)  with  probability  1/6,  ($200, $300) 
with  probability  1/6,  ($300, $100)  with  probability  1/6,  ($300, $200)  with 
probability  1/6.  Being  risk  neutral,  Player  1 views  his  corresponding  money 
lottery  as  equivalent  to  getting  its  expected  value  $(100  + 100  + 200  + 200  + 300 
+ 300)  (1/ 6)  = $200.  Similarly  for  Player  2 and  for  the  other  cells. 

Since  the  game  of  Figure  3.19  has  no  proper  subgames,  all  the  Nash  equilibria  are 
also  subgame-perfect  equilibria.  Are  some  of  the  Nash  equilibria  more  plausible 
than  others?  For  Player  1 all  the  strategies  are  weakly  dominated,  except  for  TPP 
and  TTP.  Elimination  of  the  weakly  dominated  strategies  leads  to  a game  where  Y 
is  strictly  dominated  for  Player  2.  Thus  one  could  argue  that  (TPP,  N)  and  (TTP, 
N)  are  the  most  plausible  equilibria;  in  both  of  them  Player  2 refuses  to  trade. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  3.E.5  of  Appendix  3.E  at  the 
end  of  this  chapter. 
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Appendix  3.E:  Exercises 

3.E.I.  Exercises  for  Section  3.1:  Imperfect  information 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  3.1.  Amy  and  Bill  simultaneously  write  a bid  on  a piece  of  paper.  The  bid 
can  only  be  either  $2  or  $3.  A referee  then  looks  at  the  bids,  announces  the  amount 
of  the  lowest  bid  (without  revealing  who  submitted  it)  and  invites  Amy  to  pass  or 
double  her  initial  bid.  If  Amy’s  final  bid  and  Bill’s  bid  are  equal  then  Bill  gets  the 
object  (and  pays  his  bid),  otherwise  Amy  gets  the  object  (and  pays  her  bid). 
Represent  this  situation  by  means  of  two  alternative  extensive  frames.  Note:  (1) 
when  there  are  simultaneous  moves  we  have  a choice  as  to  which  player  we  select  as 
moving  first:  the  important  thing  is  that  the  second  player  does  not  know  what  the 
first  player  did;  (2)  when  representing,  by  means  of  information  sets,  what  a player 
is  uncertain  about,  we  typically  assume  that  a player  is  smart  enough  to  deduce 
relevant  information,  even  if  that  information  is  not  explicitly  given  to  her. 

Exercise  3.2.  Consider  the  following  situation.  An  incumbent  monopolist  decides 
at  date  1 whether  to  build  a small  plant  or  a large  plant.  At  date  2 a potential  entrant 
observes  the  plant  built  by  the  incumbent  and  decides  whether  or  not  to  enter.  If 
she  does  not  enter  then  her  profits  are  0 while  the  incumbent’s  profits  are  $25 
million  with  a small  plant  and  $20  million  with  a large  plant.  If  the  potential  entrant 
decides  to  enter,  she  pays  a cost  of  entry  equal  to  $K  million  and  at  date  3 the  two 
firms  simultaneously  decide  whether  to  produce  high  output  or  low  output.  The 
profits  of  the  firms  are  as  follows  (these  figure  do  not  include  the  cost  of  entry  for 
the  entrant;  thus  you  need  to  subtract  that  cost  for  the  entrant).  In  each  cell,  the 
first  number  is  the  profit  of  the  entrant  in  millions  of  dollars  and  the  second  is  the 
profit  of  the  incumbent. 


incumbent 

incumbent 

Entrant 

low 

output 

high 

output 

low 

output 

high 

output 

low 

output 

10  , 10 

7,7 

Entrant 

low 

output 

10 , 7 

5,9 

high 

output 

7,6 

4,3 

high 

output 

7,3 

4,5 

If  incumbent  has  small  plant 

If  incumbent  has  large  plant 

Draw  an  extensive  form  that  represents  this  situation. 
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3.E.2.  Exercises  for  Section  3.2:  Strategies 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  3.3.  Write  the  strategic-form  game-frame  of  the  extensive  form  of  Exercise 
3.1  (that  is,  instead  of  writing  payoffs  in  each  cell  you  write  the  outcome).  Verify 
that  the  strategic  forms  of  the  two  possible  versions  of  the  extensive  form  are 
identical. 

Exercise  3.4.  Consider  the  extensive-form  game  of  Exercise  3.2. 

(a)  Write  down  in  words  one  of  the  strategies  of  the  potential  entrant. 

(b)  How  many  strategies  does  the  potential  entrant  have? 

(c)  Write  the  strategic-form  game  associated  with  the  extensive-form  game. 

(d)  Find  the  Nash  equilibria  for  the  case  where  K = 2. 


3.E.3.  Exercises  for  Section  3.3:  Subgames 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  3.5.  How  many  proper  subgames  does  the  extensive  form  of  Figure  3.3 
have? 

Exercise  3.6.  How  many  proper  subgames  does  the  extensive  game  of  Figure  3.5 
have? 

Exercise  3.7.  (a)  How  many  proper  subgames  does  the  extensive  game  of  Figure  2.3 
(Chapter  2)  have?  (b)  How  many  of  those  proper  subgames  are  minimal? 
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3.E.4.  Exercises  for  Section  3.4:  Subgame-perfect  equilibrium 

Exercise  3.8.  Find  the  Nash  equilibria  and  the  subgame-perfect  equilibria  of  the 
following  game. 


0 2 3 0 

2 3 3 0 

1 0 0 2 


Exercise  3.9.  Find  the  Nash  equilibria  and  the  subgame-perfect  equilibria  of  the 
following  game. 


1 


2 2 4 0 

2 0 5 6 

4 13  0 
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Exercise  3.10.  Find  the  subgame-perfect  equilibria  of  the  following  extensive-form 
game. 

1 


3.E.5.  Exercises  for  Section  3.5:  Games  with  chance  moves 

Exercise  3.11.  Modify  the  game  of  Example  3.3  as  follows:  Player  2 is  allowed  to 
secretly  check  the  content  of  his  envelope  before  he  decides  whether  or  not  to 
accept  Player  l’s  proposal. 

(a)  Represent  this  situation  as  an  extensive-form  game. 

(b)  List  all  the  strategies  of  Player  1 and  all  the  strategies  of  Player  2. 
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Exercise  3.12.  Three  players,  Avinash,  Brian  and  John,  play  the  following  game. 
Two  cards,  one  red  and  the  other  black,  are  shuffled  well  and  put  face  down  on  the 
table.  Brian  picks  the  top  card,  looks  at  it  without  showing  it  to  the  other  players 
(Avinash  and  John)  and  puts  it  back  face  down  on  the  table.  Then  Brian  whispers 
either  “Black”  or  “Red”  in  Avinash’s  ear,  making  sure  that  John  doesn’t  hear. 
Avinash  then  tells  John  either  “Black”  or  “Red”.  Finally  John  announces  either 
“Black”  or  “Red”  and  this  exciting  game  ends.  The  payoffs  are  as  follows:  if  John’s 
final  announcement  matches  the  true  color  of  the  card  Brian  looked  at,  then  Brian 
and  Avinash  give  $2  each  to  John.  In  every  other  case  John  gives  $2  each  to  Brian 
and  Avinash. 

(a)  Represent  this  situation  as  an  extensive-form  game. 

(b)  Write  the  corresponding  strategic  form  (or  normal  form)  assuming  that  the 
players  are  selfish,  greedy  and  risk  neutral. 

Exercise  3.13.  Consider  the  following  highly  simplified  version  of  Poker.  There  are 
three  cards,  marked  A,  B and  C.  A beats  B and  C,  B beats  C.  There  are  two 
players,  Yvonne  and  Zoe.  Each  player  contributes  $1  to  the  pot  before  the  game 
starts.  The  cards  are  then  shuffled  and  the  top  card  is  given,  face  down,  to  Yvonne 
and  the  second  card  (face  down)  to  Zoe.  Each  player  looks  at,  and  only  at,  her  own 
card:  she  does  not  see  the  card  of  the  other  player  nor  the  remaining  card. 
Yvonne,  the  first  player,  may  pass,  or  bet  $1.  If  she  passes,  the  game  ends,  the  cards 
are  turned  and  the  pot  goes  to  the  high-card  holder  (recall  that  A beats  B and  C,  B 
beats  C).  If  Yvonne  bets,  then  Zoe  can  fold,  in  which  case  the  game  ends  and  the  pot 
goes  to  Yvonne,  or  see  by  betting  $1.  If  Yvonne  sees  the  game  ends,  the  cards  are 
turned  and  the  pot  goes  to  the  high-card  holder.  Both  players  are  selfish,  greedy  and 
risk  neutral. 

(a)  Draw  the  extensive-form  game. 

(b)  How  many  strategies  does  Yvonne  have? 

(c)  How  many  strategies  does  Zoe  have? 

(d)  Consider  the  following  strategies.  For  Yvonne:  If  A pass,  if  B pass,  if  C bet. 
For  Zoe:  if  Yvonne  bets  and  I get  an  A I will  fold,  if  Yvonne  bets  and  I get  a 
B I will  fold,  if  Yvonne  bets  and  I get  a C I will  fold.  Calculate  the 
corresponding  payoffs. 

(e)  Redo  the  same  with  the  following  strategies.  For  Yvonne:  If  A pass,  if  B 
pass,  if  C bet.  For  Zoe:  see  always  (that  is,  no  matter  what  card  she  gets). 

(f)  Now  that  you  have  understood  how  to  calculate  the  payoffs,  represent  the 
entire  game  as  a normal  form  game,  assigning  the  rows  to  Yvonne  and  the 
columns  to  Zoe.  [This  might  take  you  the  entire  night,  but  at  your  age 
sleep  is  not  that  important  and  also  it  will  keep  you  out  of  trouble.] 

(g)  What  strategies  of  Yvonne  are  weakly  dominated?  What  strategies  of  Zoe 
are  weakly  dominated? 

(h)  What  do  you  get  when  you  apply  the  procedure  of  iterative  elimination  of 
weakly  dominated  strategies? 
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Exercise  3.14:  Challenging  Question.  In  an  attempt  to  reduce  the  deficit,  the 

government  of  Italy  has  decided  to  sell  a 14  century  palace  near  Rome.  The  palace  is  in 
disrepair  and  is  not  generating  any  revenue  for  the  government.  From  now  on  we’ll  call  the 
government  Player  G.  A Chinese  millionaire  has  offered  to  purchase  the  palace  for  $p. 
Alternatively,  Player  G can  organize  an  auction  among  n interested  parties  (n  > 2).  The 
participants  to  the  auction  (we’ll  call  them  players)  have  been  randomly  assigned  labels 
l,2,...,n.  Player  i is  willing  to  pay  up  to  %pt  for  the  palace,  where  pt  is  a positive  integer.  For 

the  auction  assume  the  following:  (1)  it  is  a simultaneous,  sealed-bid  second-price  auction,  (2) 
bids  must  be  non-negative  integers,  (3)  each  player  only  cares  about  his  own  wealth,  (4)  the 
tie-breaking  rule  for  the  auction  is  that  the  palace  is  given  to  that  player  who  has  the  lowest 
index  (e.g.  if  the  highest  bid  was  submitted  by  Players  3,  5 and  12  then  the  palace  is  given  to 
Player  3).  All  of  the  above  is  commonly  known  among  everybody  involved,  as  is  the  fact 
that  for  every  i,je{l,...,n}  with  / T j,  pt  it  p . . We  shall  consider  four  different 

scenarios.  In  all  scenarios  you  can  assume  that  the pfs  are  common  knowledge. 

Scenario  1.  Player  G first  decides  whether  to  sell  the  palace  to  the  Chinese  millionaire 
or  make  a public  and  irrevocable  decision  to  auction  it. 

(a)  Draw  the  extensive  form  of  this  game  for  the  case  where  n = 2 and  the  only 
possible  bids  are  $1  and  $2.  [List  payoffs  in  the  following  order:  first  G then  1 then 
2.] 

(b)  For  the  general  case  where  n > 2 and  every  positive  integer  is  a possible  bid,  find  a 
pure-strategy  subgame-perfect  equilibrium  of  this  game.  What  are  the  players’ 
payoffs  at  the  equilibrium? 

Scenario  2.  Here  we  assume  that  n = 2,  and  px  > p,  + 1 > 2 . First  Player  G decides 
whether  to  sell  the  palace  to  the  Chinese  or  make  a public  and  irrevocable  decision  to 
auction  it.  In  the  latter  case  he  first  asks  Player  2 to  publicly  announce  whether  or  not 
he  is  going  to  participate  in  the  auction.  If  Player  2 says  Yes  then  he  has  to  pay  $1  to 
Player  G as  a participation  fee,  which  is  non-refundable.  If  he  says  No  then  she  is  out 
of  the  game.  After  player  2 has  made  his  announcement  (and  paid  his  fee  if  he  decided 
to  participate)  Player  1 is  asked  to  make  the  same  decision  (participate  and  pay  a non- 
refundable  fee  of  $1  to  Player  G or  stay  out);  Player  1 knows  player  2’s  decision  when 
he  makes  his  own  decision.  After  both  players  have  made  their  decisions,  player  G 
proceeds  as  follows:  (1)  if  both  1 and  2 said  Yes  then  he  makes  them  play  a 
simultaneous  second-price  auction,  (2)  if  only  one  player  said  Yes  then  he  is  asked  to 
put  an  amount  $x  of  his  choice  in  an  envelope  (where  x is  a positive  integer)  and  give 
it  to  Player  G in  exchange  for  the  palace,  (3)  if  both  1 and  2 said  No  then  G is  no 
longer  bound  by  his  commitment  to  auction  the  palace  and  he  sells  it  to  the  Chinese. 

(c)  Draw  the  extensive  form  of  this  game  for  the  case  where  the  only  possible  bids  are 

$1  and  $2  and  also  x e {1,2}  [List  payoffs  in  the  following  order:  first  G then  1 
then  2.] 

(d)  For  the  general  case  where  all  possible  bids  are  allowed  (subject  to  being  positive 
integers)  and  x can  be  any  positive  integer,  find  a pure-strategy  subgame-perfect 
equilibrium  of  this  game.  What  are  the  players’  payoffs  at  the  equilibrium? 
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Scenario  3.  Same  as  Scenario  2;  the  only  difference  is  that  if  both  Players  1 and  2 
decide  to  participate  in  the  auction  then  Player  G gives  to  the  loser  the  fraction  a (with 
0 < a < 1)  of  the  amount  paid  by  the  winner  in  the  auction  (note  that  player  G still 
keeps  100%  of  the  participation  fees).  This  is  publicly  announced  at  the  beginning  and 
is  an  irrevocable  commitment. 

(e)  For  the  general  case  where  all  possible  bids  are  allowed  (subject  to  being  positive 
integers)  find  a subgame-perfect  equilibrium  of  this  game.  What  are  the  players’ 
payoff  at  the  equilibrium? 

Scenario  4.  Player  G tells  the  Chinese  millionaire  the  following: 

“First  you  (=  the  Chinese)  say  Yes  or  No;  if  you  say  No  I will  sell  you  the 
palace  for  the  price  that  you  offered  me,  namely  $100  (that  is,  we  now 
assume  that  p = 100);  if  you  say  Yes  then  we  play  the  following  perfect 
information  game.  I start  by  choosing  a number  from  the  set  {1,2,3},  then 
you  (=  the  Chinese)  choose  a number  from  this  set,  then  I choose  again, 
followed  by  you,  etc.  The  first  player  who  brings  the  cumulative  sum  of  all 
the  numbers  chosen  (up  to  and  including  the  last  one)  to  40  wins.  If  you 
win  I will  sell  you  the  palace  for  $50,  while  if  I win  I will  sell  you  the 
palace  for  $200.”  Thus  there  is  no  auction  in  this  scenario.  Assume  that 
the  Chinese  would  actually  be  willing  to  pay  up  to  $300  for  the  palace. 

(f)  Find  a pure-strategy  subgame-perfect  equilibrium  of  this  game. 
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Appendix  3.S:  Solutions  to  exercises 


Exercise  3.1.  One  possible  extensive  frame  is  as  follows,  where  Amy  moves 
first.  Note  that  we  have  only  one  non-trivial  information  set  for  Amy,  while 
each  of  the  other  three  consists  of  a single  node.  The  reason  is  as  follows:  if 
Amy  initially  bids  $3  and  Bill  bids  $2  then  the  referee  announces  “the  lowest 
bid  was  $2”;  this  announcement  does  not  directly  reveal  to  Amy  that  Bill’s  bid 
was  $2,  but  she  can  figure  it  out  from  her  knowledge  that  her  own  bid  was  $3; 
similarly,  if  the  initial  two  bids  are  both  $3  then  the  referee  announces  “the 
lowest  bid  was  $3”,  in  which  case  Amy  is  able  to  figure  out  that  Bill’s  bid  was 
also  $3.  If  we  included  those  two  nodes  in  the  same  information  set  for  Amy, 
we  would  not  show  much  faith  in  Amy’s  reasoning  ability! 


AMY 


Bill  wins  Amy  wins  Bill  wins  Amy  wins  Amy  wins  Amy  wins  Bill  wins  Amy  wins 

and  pays  $2  and  pays  $4  and  pays  $3  and  pays  $4  and  pays  $3  and  pays  $6  and  pays  $3  and  pays  $6 
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Another  possible  extensive  frame  is  as  follows,  where  Bill  moves  first. 

BILL 


Amy  wins 
and  pays  $3 


Amy  wins 
and  pays  $4 


Bill  wins 
and  pays  $3 


Amy  wins 
and  pays  $' 


Exercise  3.2.  The  extensive  form  is  as  follows. 
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Exercise  3.3.  Amy’s  strategy  (x,y,w,z)  means:  at  the  beginning  I bid  $x,  at  the 
non-trivial  information  set  on  the  left  I choose  y,  at  the  singleton  node  in  the 
middle  I choose  w and  at  the  singleton  node  on  the  right  I choose  2.  The 
numbers  are  bid  amounts  and  P stands  for  “Pass”. 

BILL 


bid  $2 

bid  $3 

$2,  p,  p.  $p 

Bill  wins  pays  2 

Bill  wins  pays  3 

2 . P,  P, 6 

Bill  wins  pays  2 

Bill  wins  pays  3 

2,  P,  6.  P 

Bill  wins  pays  2 

Bill  wins  pays  3 

2,  P.  6,  6 

Bill  wins  pays  2 

Bill  wins  pays  3 

A 2,4.  P.P 

Amy  wins  pays  4 

Amy  wins  pays  4 

M 2,  4,  P.  6 

Amy  wins  pays  4 

Amy  wins  pays  4 

Y 2,  4,  6.  P 

Amy  wins  pays  4 

Amy  wins  pays  4 

2,  4,  6, 6 

Amy  wins  pays  4 

Amy  wins  pays  4 

3,  P,  P.  P 

Amy  wins  pays  3 

Bill  wins  pays  3 

3 , P,  P,  6 

Amy  wins  pays  3 

Amy  wins  pays  6 

3,  P,  6,  P 

Amy  wins  pays  6 

Bill  wins  pays  3 

3,  P.  6, 6 

Amy  wins  pays  6 

Amy  wins  pays  6 

3,4.  P,  P 

Amy  wins  pays  3 

Bill  wins  pays  3 

3, 4.  P.  6 

Amy  wins  pays  3 

Amy  wins  pays  6 

3,4,  6,  P 

Amy  wins  pays  6 

Bill  wins  pays  3 

3, 4,  6,  6 

Amy  wins  pays  6 

Amy  wins  pays  6 

Exercise  3.4.  (a)  The  potential  entrant  has  four  information  sets,  hence  a 

strategy  has  to  say  what  she  would  do  in  each  of  the  four  situations.  A possible 
strategy  is:  “if  the  incumbent  chooses  a small  plant  I stay  out,  if  the  incumbent 
chooses  a large  plant  I enter,  if  small  plant  and  I entered  then  I choose  low 
output,  if  large  plant  and  I entered  then  I choose  high  output”. 

4 

(b)  The  potential  entrant  has  2 = 16  strategies. 
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(c)  The  strategic  form  is  as  follows. 

INCUMBENT 


SLL 

SLH 

SHL 

SHH 

LLL 

LLH 

LHL 

LHH 

OOLL 

0,25 

0,25 

0,25 

0,25 

0,20 

0,20 

0,20 

0,20 

OOLH 

0,25 

0,25 

0,25 

0,25 

0,20 

0,20 

0,20 

0,20 

OOHL 

0,25 

0,25 

0,25 

0,25 

0,20 

0,20 

0,20 

0,20 

OOLH 

0,25 

0,25 

0,25 

0,25 

0,20 

0,20 

0,20 

0,20 

E 

OILL 

0,25 

0,25 

0,25 

0,25 

10-K 

5-K, 

10-K 

5-K, 

,7 

9 

,7 

9 

N 

OILH 

0,25 

0,25 

0,25 

0,25 

7-K, 

4-K, 

7-K, 

4-K, 

3 

5 

3 

5 

T 

OIHL 

0,25 

0,25 

0,25 

0,25 

10-K 

5-K, 

10-K 

5-K, 

,7 

9 

,7 

9 

R 

OIHH 

0,25 

0,25 

0,25 

0,25 

7-K, 

4-K, 

7-K, 

4-K, 

3 

5 

3 

5 

A 

IOLL 

10-K, 

10-K, 

7-K,  7 

7-K, 

0,20 

0,20 

0,20 

0,20 

10 

10 

7 

N 

IOLH 

10-K, 

10-K, 

7-K,  7 

7-K, 

0,20 

0,20 

0,20 

0,20 

10 

10 

7 

T 

IOHL 

7-K,  6 

7-K,  6 

4-  K,3 

4-K, 

3 

0,20 

0,20 

0,20 

0,20 

IOHH 

7-K,  6 

7-K,  6 

4-  K,3 

4-K, 

3 

0,20 

0,20 

0,20 

0,20 

IILL 

10-K, 

10-K, 

7-K,  7 

7-K, 

10-K 

5-K, 

10-K 

5-K, 

10 

10 

7 

,7 

9 

,7 

9 

IILH 

10-K, 

10-K, 

7-K,  7 

7-K, 

7-K, 

4-K, 

7-K, 

4-K, 

10 

10 

7 

3 

5 

3 

5 

IIHL 

7-K,  6 

7-K,  6 

4-  K,3 

4-K, 

10-K 

5-K, 

10-K 

5-K, 

3 

,7 

9 

,7 

9 

IIHH 

7-K,  6 

7-K,  6 

4-  K,3 

4-K, 

7-K, 

4-K, 

7-K, 

4-K, 

3 

3 

5 

3 

5 

(d)  For  the  case  where  K =2  the  Nash  equilibria  are  highlighted  in  the  above 
table. 


Exercise  3.5.  None. 


Exercise  3.6.  None. 
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Exercise  3.7.  The  game  under  consideration  is  the  following: 

1 


2 0 
1 1 


(a)  There  are  three  proper  subgames:  one  starting  from  node  x,  one  starting 
from  node  y and  one  starting  at  the  node  of  Player  3. 

(b)  Two:  the  one  starting  from  node  x and  the  one  starting  at  the  decision  node 
of  Player  3.  In  a perfect-information  game  a minimal  proper  subgame  is  one 
that  starts  at  a decision  node  followed  only  by  terminal  nodes. 

Exercise  3.8.  The  Nash  equilibria  are  ( B,C,E)  and  ( B,D,E ).  There  are  no 
subgame-perfect  equilibria  since  the  proper  subgame  has  no  Nash  equilibria. 

Exercise  3.9.  The  strategic  form  is  as  follows: 

Player  2 Player  2 


GC 

GD 

HC 

HD 

GC 

GD 

HC 

HD 

p 

1 

A 

2,  2,4 

2,  2,4 

2,  0,  1 

2,  0,  1 

A 

4,  5,3 

4,  5,3 

0,  6,0 

0,  6,0 

a 

y 

e 

r 

B 

1,4,3 

4,  3,2 

1,4,3 

4,  3,2 

B 

1,4,3 

4,3,2 

1,4,3 

4,3,2 

1 


Player  3 chooses  E 


Player  3 chooses  F 


The  Nash  equilibria  are:  (A,{G,C),E)  and  (. B,{H,C),E ). 

The  extensive-form  game  has  two  proper  subgames.  The  one  on  the  left  has  a 
unique  Nash  equilibrium,  ( G,E ),  and  the  one  on  the  right  has  a unique  Nash 
equilibrium,  C.  Hence  the  game  reduces  to: 
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2 

4 


where  A is  the  unique  optimal  choice.  Hence  there  is  only  one  subgame-perfect 
equilibrium,  namely  ( A,(G,C),E ). 

Exercise  3.10.  Consider  first  the  subgame  that  starts  at  Player  2’s  decision  node 
coming  after  choice  A of  Player  1 (only  the  payoff  of  Players  2 and  3 are  shown 
in  the  following  table): 


Player 

2 


Player  3 

F G H 


c 

O 

O 

2,0 

1,1 

D 

1,0 

1,2 

2,1 

E 

0,1 

0,1 

3,2 

The  unique  Nash  equilibrium  is  (E,  H). 


Now  consider  the  subgame  that  starts  at  Player  2’s  decision  node  coming  after 
choice  B of  Player  1 


Player  2 


L 

M 


Player  3 

P R 


2 , 1 

1 , 0 

O 

o 

1 , 1 

There  are  two  Nash  equilibria:  (L,  P)  and  (M,  R). 

Thus  there  are  two  subgame-perfect  equilibria  of  the  entire  game: 

(1)  (4,(E’,Z),((//,E>)) . Player  l’s  strategy:  A,  Player  2’s  strategy:  E if  A and  L 
if  B,  Player  3’s  strategy:  H if  A and  P if  B: 
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(2)  ( B,(E,M),((H,R )) . Player  l’s  strategy:  B,  Player  2’s  strategy:  E if  A and  M 
if  B,  Player  3’s  strategy:  H if  A and  R if  B. 


Exercise  3.11.  (a)  The  game  is  as  follows. 


NATURE  (all  choices  have  an  equal 


(b)  Player  l’s  strategies  are  the  same  as  in  Example  3.3.  Player  2 now  has  8 
strategies.  Each  strategy  has  to  specify  how  to  reply  to  Player  l’s  proposal 
depending  on  the  sum  he  (Player  2)  has.  Thus  one  possible  strategy  is:  if  I 
have  $100  I say  No,  if  I have  $200  I say  Yes  and  if  I have  $300  I say  No. 


145 


GAME  THEORY  - Giacomo  Bonanno 


Exercise  3.12.  (a)  The  game  is  as  follows. 


-2,-2, 4 

2, 2, -4 
-2, -2, 4 

2, 2, -4 


2, 2, -4 

-2, -2, 4 
2, 2, -4 

-2, -2, 4 


(b)  Each  player  has  two  information  sets,  two  choices  at  each  information  set, 
hence  four  strategies.  The  strategic  form  is  given  by  the  following  set  of  tables. 

Avinash 


if  B,  B,  if  B,  B,  if  B,  R,  if  B,  R, 

if  R,  R if  R,  B if  R,  R if  R,  B 


if  B,  B, 
if  R,  R 


Brian  if  B,  B, 

if  R,  B 


if  B,  R, 
if  R,  R 


if  B,  R, 
if  R,  B 


-2,-2,  4 

0,  0,  0 

0,  0,  0 

2,  2,  -4 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

2,  2,  -4 

0,  0,  0 

0,  0,  0 

-2,-2,  4 

John  chooses:  if  B,  B and  if  R,  R 
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Avinash 

if  B,  B,  if  B,  B,  if  B,  R,  if  B,  R, 

if  R,  R if  R,  B if  R,  R if  R,  B 


if  B,  B, 
if  R,  R 


Brian  if  B,  B, 

if  R,  B 


if  B,  R, 
if  R,  R 


if  B,  R, 
if  R,  B 


John  chooses:  if  B,  B and  if  R,  B 


0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

Avinash 

if  B,  B,  if  B,  B,  if  B,  R,  if  B,  R, 

if  R,  R if  R,  B if  R,  R if  R,  B 


if  B,  B, 
if  R,  R 


Brian  if  B,  B, 

if  R,  B 


if  B,  R, 
if  R,  R 


if  B,  R, 
if  R,  B 


John  chooses:  if  B,  R and  if  R,  R 


0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 
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Avinash 


if  B,  B, 

if  B,  B, 

if  B,  R, 

if  B,  R, 

if  R,  R 

if  R,  B 

if  R,  R 

if  R,  B 

if  B,  B, 

2,  2,  -4 

0,  0,  0 

0,  0,  0 

-2,-2,  4 

if  R,  R 

Brian  if  B,  B, 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

if  R,  B 

if  B,  R, 
if  R,  R 

0,  0,  0 

0,  0,  0 

0,  0,  0 

0,  0,0 

if  B,  R, 

-2,-2,  4 

0,  0,  0 

0,  0,  0 

2,  2,  -4 

if  R,  B 

John  chooses:  if  B,  R and  if  R,  B 

How  can  we  fill  in  the  payoffs  without  spending  more  than  24  hours  on  this 
problem?  There  is  a quick  way  of  doing  it.  First  of  all,  when  John’s  strategy  is 
to  guess  Black,  no  matter  what  Avinash  says,  then  he  has  a 50%  chance  of  being 
right  and  a 50%  chance  of  being  wrong.  Thus  his  expected  payoff  is 
d(4)  + y(-4)  = 0 and’  similarly,  the  expected  payoff  of  each  of  the  other  two 
players  is  + = 0 . This  explains  why  the  second  table  is  filled  with  the 

same  payoff  vector,  namely  (0,0,0).  The  same  reasoning  applies  to  the  case 
where  when  John’s  strategy  is  to  guess  Red,  no  matter  what  Avinash  says 
(leading  to  the  third  table,  filled  with  the  same  payoff  vector  (0,0,0)). 

For  the  remaining  strategies  of  John’s,  one  can  proceed  as  follows.  Start  with 
the  two  colors,  B and  R.  Under  B write  T (for  true)  if  Brian’s  strategy  says  “if  B 
then  B”  and  write  F (for  false)  if  Brian’s  strategy  says  “if  B then  R”;  similarly, 
under  R write  T (for  true)  if  Brian’s  strategy  says  “if  R then  R”  and  write  F (for 
false)  if  Brian’s  strategy  says  “if  R then  B”.  In  the  next  row,  in  the  B column 
rewrite  what  is  in  the  previous  row  if  Avinash’s  strategy  says  “if  B then  B”  and 
change  a T into  an  F or  an  F into  a T if  Avinash’s  strategy  says  “if  B then  R”. 
Similarly  for  the  R column.  Now  repeat  the  same  for  John  (in  the  B column  a 
T remains  a T and  an  F remains  an  F is  John’s  strategy  is  “if  B then  B”,  while  a 
T is  changed  into  an  F and  an  F is  changed  into  a T if  John’s  strategy  is  “if  B 
then  R”).  Now  in  each  column  the  payoffs  are  (-2,  -2,  4)  if  the  last  row  has  a T 
and  (2,  2,  -4)  if  the  last  row  has  an  F.  The  payoffs  are  then  given  by  \ the 
payoff  in  the  left  column  plus  \ the  payoff  in  the  right  column.  For  example, 
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for  the  cell  in  the  second  row,  third  column  of  the  third  table  we  have  the 
following. 


B 

R 

Brian’s  strategy: 
if  B,  B and  if  R,  B 

T 

F 

Avinash’s 
strategy: 
if  B,  R and  if  R, 
R 

F 

T 

John’s  strategy: 
if  B,  R and  if  R, 
R 

F 

T 

Payoffs: 

Payoffs 

(2,  2,  -4) 

(-2,  -2,4) 

4(2,2,  -4)  + 4 (-2,  -2,4)  = (0,0, 
0) 

Exercise  3.13.  (a)  The  extensive-form  representation  of  the  simplified  poker 
game  is  as  follows  (the  top  number  is  Yvonne’s  net  take  in  dollars  and  the 
bottom  number  is  Zoe’s  net  take). 


NATURE 


BAC Xl6)  \ CAd^i/6) 
ACBwrg)  / BCA\i/6)  \ 


Pass  Pass 
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(b)  and  (c)  Each  player  has  eight  strategies  (three  information  sets,  two  choices 
at  each  information  set,  thus  2 x 2 x 2 = 8 possible  strategies). 

(d)  Yvonne  uses  the  strategy  “If  A pass,  if  B pass,  if  C bet”  and  Zoe  uses  the 
strategy  “If  A fold,  if  B fold,  if  C fold”).  We  calculate  the  expected  net  payoff 
for  Yvonne.  Zoe’s  expected  net  payoff  is  the  negative  of  that. 


Possible 

Top  card  is  A 

A 

B 

Cases 

Second  is  B 

C 

A 

Y’s  payoff 

1 

1 

-1 

Explain 

pass 

pass 

pass 

B C 

C A 


C 

B 


Sum  Probability 
of  each 


Expected 

payoff 


1 1 


1 


4 


1 

6 


pass  bet  + bet  + 
fold  fold 


(e)  Yvonne  uses  the  strategy  “If  A pass,  if  B pass,  if  C bet”  and  Zoe  uses  the 
strategy  “see  with  any  card”.  Once  again,  we  calculate  the  expected  net 
payoff  for  Yvonne.  Zoe’s  expected  net  payoff  is  the  negative  of  that. 


Possible 

Top  card  is  A 

A 

B 

B 

C 

C 

Sum  Probability 

Expected 

Cases 

Second  is  B 

C 

A 

C 

A 

B 

of  each 

payoff 

f’s  payoff 

1 

1 

-1 

1 

-2 

-2 

- f 

2 

“6 

Explain 

pass 

pass 

pass 

pass 

bet  + 

bet  + 

see 

see 
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(f)  The  strategic  form  is  as  follows. 


ZOE 


If  A fold, 

If  A see, 

If  A see, 

If  A fold, 

If  A fold, 

If  A see, 

If  A see, 

If  A fold, 

If  B fold, 

If  B see, 

If  B fold, 

If  B see, 

If  B fold, 

If  B see, 

If  B fold, 

If  B see, 

If  C fold 

If  C see 

If  C fold 

If  C fold 

If  C see 

If  C fold 

If  C see 

If  C see 

If  A pass, 
if  B pass 
if  C pass 
If  A bet, 
if  B bet 
if  C bet 
If  A bet, 
if  B pass 
if  C pass 
If  A pass, 
if  B bet 
if  C pass 
If  A pass, 
if  B pass 
if  C bet 
If  A bet, 
if  B bet 
if  C pass 
If  A bet, 
if  B pass 
if  C bet 
if  A Pass, 
If  Bor  C, 
Bet, 


0,  0 

0,  0 

0,  0 

0,  0 

0,  0 

0,  0 

0,  0 

0,  0 

1,  -1 

0,  0 

0,  0 

4/6,  -4/6 

8/6,  -8/6 

-2/6,  2/6 

2/6,  -2/6 

1,  -1 

0,  0 

2/6,  -2/6 

0,  0 

1/6,  -1/6 

1/6,  -1/6 

1/6,  -1/6 

1/6,  -1/6 

2/6,  -2/6 

2/6,  -2/6 

0,  0 

-1/6,  1/6 

2/6,  -2/6 

3/6,  -3/6 

-1/6,  1/6 

0,  0 

3/6,  -3/6 

4/6,  -4/6 

-2/6,  2/6 

1/6,  -1/6 

1/6,  -1/6 

4/6,  -4/6 

-2/6,  2/6 

1/6,  -1/6 

1/6,  -1/6 

2/6,  -2/6 

2/6,  -2/6 

-1/6,  1/6 

3/6,  -3/6 

4/6,  -4/6 

0,  0 

1/6,  -1/6 

5/6,  -5/6 

4/6,  -4/6 

0,  0 

1/6,  -1/6 

2/6,  -2/6 

5/6,  -5/6 

-1/6,  1/6 

2/6,  -2/6 

3/6,  -3/6 

1,  -1 

-2/6,  2/6 

0,  0 

3/6,  -3/6 

7/6,  -7/6 

-3/6,  3/6 

1/6,  -1/6 

4/6,  -4/6 

(g)  Let  denote  weak  dominance,  that  is,  a b means  that  a weakly 
dominates  b. 

FOR  YVETTE  (row  player):  3rd  row  ^ 1st  row,  6th  ^ 4th,  ?b  ^ 4th,  fh  ^ 

th  nd  ^ th 

5 , 2 ^18. 


FOR  ZOE  (column  player):  3 col  1 col,  3 4 , 3 5 , 3 7 , 

3rd  8th,  2nd  8th,  4th  ^ 5lh,  4th  ^ 8th,  6th  2nd,  6th  4th,  6th  ^ 
5th 

th  A th  th  ^ th  th  ^ th  tli  th  th  ^ th 

6 a 7 , 6 *18,7  a 4 , 7 *15,  7 *18. 
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(h)  Eliminating  rows  1,  4,  5 and  8 and  all  columns  except  3 and  6 we  are  left 
with: 

Zoe 


See  only  if  See  with  A or 

A B 

Bet  always 

If  A,  Bet, 
otherwise 
pass 

If  A or  B, 

Bet, 

if  C Pass 
If  A or  C, 

Bet, 

if  B Pass 


0,  0 

-2/6,  2/6 

0,  0 

1/6,  -1/6 

-1/6,  1/6 

0,  0 

1/6,  -1/6 

-1/6,  1/6 

In  the  above  table,  the  second  row  dominates  the  first  and  the  third. 
Eliminating  them  we  have  the  following,  which  is  a remarkable  simplification 
of  the  original  strategic  form: 

Zoe 


See  only  if  See  with  A or 
A B 

Y If  A,  Bet, 
otherwise 
pass 

If  A or  C, 

Bet, 

if  B Pass 


o 

c\ 

o 

1/6,  -1/6 

1/6,  -1/6 

-1/6,  1/6 
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Exercise  3.14.  (a)  The  extensive  form  is  as  follows: 


G sell  to  Chinese 


Gil  1 2 

P1-  1 pi—  1 0 p\-\  p\-2 

P1-  2 0 pi-\  0 0 

(b)  In  the  auction  subgame  for  every  player  it  is  a weakly  dominant  strategy  to 
bid  his  own  value.  Let  » = max {/?,.}  be  the  highest  value  and 

pk  = max  { pt } be  the  second  highest  value.  Then  the  auction,  if  it  takes 

i*j 

place,  will  be  won  by  Player  j and  he  will  pay  pk . Hence  there  are  three 
cases.  Case  1:  p > pk . In  this  case  Player  G will  sell  to  the  Chinese  (and  the 
strategy  of  Player  i in  the  subgame  is  to  bid  p.),  G’s  payoff  is  p and  the 
payoff  of  Player  z is  0.  Case  2:  p < pk . In  this  case  Player  G announces  the 
auction,  the  strategy  of  Player  i in  the  subgame  is  to  bid  pt , the  winner  is 
Player  j and  he  pays  pk , so  that  the  payoff  of  G is  pk , the  payoff  of  player  j 
is  Pj-Pk  and  the  payoff  of  every  other  player  is  0. 
Case  3:  p = pk . In  this  case  there  are  two  subgame-perfect  equilibria:  one  as 
in  Case  1 and  the  other  as  in  Case  2 and  G is  indifferent  between  the  two. 
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(c)  The  game  is  as  follows: 


^ sell  to  Chinese 


G 3 3 3 4 

pi-  1 p i—2  —1  pi— 2 pi— 3 

P1-  2 -1  p2 — 2 -1  _ 1 


(d)  In  the  simultaneous  subgame  after  both  players  have  said  Yes,  the 
participation  fee  paid  is  a sunk  cost  and  for  every  player  bidding  the  true 
value  is  a weakly  dominant  strategy.  Thus  the  outcome  there  is  as  follows: 
Player  1 bids  pv  gets  the  palace  by  paying  p2,  G’s  payoff  is  (p2  + 2),  l’s 

payoff  is  {px  - p2  - 1)  and  Player  2’s  payoff  is  - 1.  In  the  subgames  where  one 
player  said  No  and  the  other  said  Yes  the  optimal  choice  is  obviously  x = 1, 
with  payoffs  of  2 for  Player  G,  0 for  the  player  who  said  No  and  pi  -2  for 

the  player  who  said  Yes.  Thus  the  game  reduces  to  the  one  shown  in  Figure 
A below: 
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By  assumption,  px  > p2  + 1 > 2,  so  that  pl  - p2  - 1 > 0 and  px  - 2 > 0. 
Thus  at  the  bottom  node  and  at  the  left  node  player  1 prefers  Yes  to  No. 
Thus  the  game  reduces  to  the  one  shown  in  Figure  B. 

Hence  Player  2 will  say  No.  Thus  the  subgame-perfect  equilibrium  is  as 
follows: 

(1)  if  p > 2 then  player  G will  sell  to  the  Chinese  (and  the  choices  off  the 
equilibrium  path  are  as  explained  above)  and  the  payoffs  are  (p,  0 , 0); 

(2)  if  p < 2 then  G chooses  to  auction,  2 says  No,  1 says  Yes  and  then 
offers  $1  and  the  payoffs  are  (2,  px  -2,0)  (and  the  choices  off  the  equilibrium 
path  are  as  explained  above); 

(3)  iip  = 2 then  there  are  two  equilibria:  one  as  in  (1)  and  the  other  as  in  (2). 


(e)  When  the  loser  is  given  the  fraction  a of  the  amount  paid  by  the  winner 
(that  is,  the  loser  is  given  the  fraction  a of  his  own  bid),  it  is  no  longer  true 
that  bidding  one’s  true  value  is  a dominant  strategy.  In  fact,  (p1,p2)  is 

not  even  a Nash  equilibrium  any  more.  To  see  this,  imagine  that  Player  l’s 
true  value  is  10  and  Player  2’s  true  value  is  6 and  a = 50%.  Then  if  Player  1 
bids  10  and  2 bids  6,  Player  2 ends  up  losing  the  auction  but  being  given  $3, 
while  if  he  increased  his  bid  to  8 then  he  would  still  lose  the  auction  but 
receive  $4.  This  shows  that  there  cannot  be  a Nash  equilibrium  where 
Player  2 bids  less  than  Player  1.  Now  there  are  several  Nash  equilibria  of 
the  auction,  for  example,  all  pairs  (h,,h2)  with  bx=b2=b  and  p2<b<  px 


provided  that  px-b>  a(b  - 1)  , that  is,  b < ^ + — (but  there  are  more:  for 

1 + a 

example  all  pairs  (h1562)  with  bx=b2=b  and  b < p2  provided  that 
pl-b>a(b- 1)  and  ab>  p2-b).  Thus  to  find  a subgame-perfect 
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equilibrium  of  the  game  one  first  has  to  select  a Nash  equilibrium  of  the 
auction  game  and  then  apply  backward  induction  to  see  if  the  players 
would  want  to  say  Yes  or  No  to  the  auction,  etc. 

(f)  Let  us  start  by  considering  the  perfect-information  game  that  is  played  if  the 
Chinese  says  Yes.  This  is  a game  similar  to  the  one  discussed  in  Example  2.2 
(Chapter  2,  Section  2.5).  We  first  determine  the  losing  positions.  Whoever 
has  to  move  when  the  sum  is  36  cannot  win.  Thus  36  is  a losing  position. 
Working  backwards,  the  losing  positions  are  32,  28,  24,  20,  16,  12,  8,  4 and 
0.  Thus  the  first  player  (=  player  G)  starts  from  a losing  position:  whatever 
his  initial  choice,  he  can  be  made  to  choose  the  second  time  when  the  sum  is 
4,  and  then  8,  etc.  Hence  the  second  player  (=  the  Chinese)  has  a 
winning  strategy,  which  is  as  follows:  if  Player  G just  chose  n,  then  choose 
(4  -n).  If  the  Chinese  says  Yes  and  then  follows  this  strategy  he  can 
guarantee  that  he  will  buy  the  palace  for  $50.  Thus  the  subgame-perfect 
equilibrium  of  this  game  is:  the  Chinese  says  Yes  and  uses  the  winning 
strategy  in  the  ensuing  game,  while  for  Player  G we  can  pick  any  arbitrary 
choices. 
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Chapter 


Expected  Utility 


4.1  Money  lotteries  and  attitudes  to  risk 


At  the  end  of  Chapter  3 we  showed  how  to  incorporate  random  events 
in  extensive-form  games  by  means  of  chance  moves.  The  introduction 
of  chance  moves  gives  rise  to  probabilistic  outcomes,  which  we  called 
lotteries.  In  Chapter  3 we  restricted  attention  to  lotteries  whose  outcomes  are 
sums  of  money  (money  lotteries)  and  to  one  possible  way  of  ranking  such 
lotteries,  based  on  the  notion  of  risk  neutrality.  In  this  section  we  will  continue 

to  focus  on  money  lotteries  and  define  other  possible  attitudes  to  risk. 

Throughout  this  chapter  we  will  restrict  attention  to  finite  lotteries.  Recall  that 
a money  lottery  is  a probability  distribution  of  the  form 


$x2  ...  $xn ^ 

V Pi  Pi  - P„  J 


(0  < pt  < 1,  for  all  i = 1,2,...,/?  and  pl  + p2  +...  + pn  = 1) 


and  that  (Definition  3.13,  Chapter  3)  its  expected  value  is  the  number 
(xj/z,  +x2p2  +...  + xnpn} . If  L is  a money  lottery,  we  will  denote  by  E[T]  the 

, , . r ^ . , ..  f $30  $45  $9(D 

expected  value  of  L.  Thus,  for  example,  if  L = 


V 3 


1 
9 J 


then  E[L]  = 


y (30)  + 1 (45)  + \ (90)  = 45.  Recall  also  (Definition  3.14,  Chapter  3)  that  a 
person  is  said  to  be  risk  neutral  if  she  considers  a money  lottery  to  be  just  as 
good  as  its  expected  value.  For  example,  a risk-neutral  person  would  consider 
getting  $45  with  certainty  to  be  just  as  good  a playing  lottery 


1 In  the  next  section  we  will  consider  more  general  lotteries,  where  the  outcomes  need  not  be 
sums  of  money,  and  also  introduce  the  theory  of  expected  utility  which  will  then  be  used  in 
the  context  of  games  in  Chapters  5 and  6. 
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L = 


$30  $45  $90' 


V 3 9 

risk  neutrality. 


We  can  now  consider  different  attitudes  to  risk,  besides 


9 J 


Definition  4.1.  Let  I be  a money  lottery  and  consider  the  choice  between  L 

4$e[lD 

(that  is,  the  choice  between  facing  the 

v 1 ) 

lottery  L or  getting  the  expected  value  of  L with  certainty).  Then 


and  the  degenerate  lottery 


• An  individual  who  prefers  $E[L]  for  certain  to  L is  said  to  be  risk 
averse. 

• An  individual  who  is  indifferent  between  $ E[L]  for  certain  and  L is  said 
to  be  risk  neutral. 

• An  individual  who  prefers  L to  $E[L]  for  certain  is  said  to  be  risk 
loving. 


Note  the  following:  if  an  individual  is  risk  neutral,  has  transitive  preferences 
over  money  lotteries  and  prefers  more  money  to  less,  then  we  can  tell  how  that 
individual  ranks  any  two  money  lotteries.  For  example,  how  would  such  an 


individual  rank  the  two  lotteries  L , = 


$30  $45  $90 


i 

V 3 


1 

9 J 


and  L2  = 


$5  $100' 


3 

V 5 


Since  E[LJ  = 45  and  the  individual  is  risk  neutral,  Lx  ~ $45 ; since  E[Z2]  = 43 
and  the  individual  is  risk  neutral,  $43  ~ L2 ; since  the  individual  prefers  more 
money  to  less,  $45  >-  $43 ; thus,  by  transitivity,  Lx>-  L2.  On  the  other  hand, 
knowing  that  an  individual  is  risk  averse,  has  transitive  preferences  over  money 
lotteries  and  prefers  more  money  to  less  is  not  sufficient  to  predict  how  she  will 
choose  between  two  arbitrary  money  lotteries.  For  example,  as  we  will  see  later 

f $28^ 

(Exercise  4.11),  it  is  possible  that  one  such  individual  will  prefer  Z3  = 

1 


(whose  expected  value  is  28)  to  LA 


'$10  $50^ 

i i 

V 2 2 J 


(whose  expected  value  is  27), 


while  another  such  individual  will  prefer  LA  to  L2 . Similarly,  knowing  that  an 
individual  is  risk  loving,  has  transitive  preferences  over  money  lotteries  and 
prefers  more  money  to  less  is  not  sufficient  to  predict  how  she  will  choose 
between  two  arbitrary  money  lotteries. 
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Remark  4.1.  Note  that  “rationality”  does  not  and  cannot  dictate  whether  an 
individual  should  be  risk  neutral,  risk  averse  or  risk  loving:  an  individual’s 
attitude  to  risk  is  merely  a reflection  of  that  individual’s  preferences.  It  is  a 
generally  accepted  principle  that  de  gustibus  non  est  disputandum  (in  matters  of 
taste,  there  can  be  no  disputes).  According  to  this  principle,  there  is  no  such 
thing  as  an  irrational  preference  and  thus  there  is  no  such  thing  as  an  irrational 
attitude  to  risk.  As  a matter  of  fact,  most  people  reveal  through  their  choices 
(e.g.  the  decision  to  buy  insurance)  that  they  are  risk  averse,  at  least  when  the 
stakes  are  high. 

As  noted  above,  with  the  exception  of  risk-neutral  individuals,  even  if  we 
restrict  attention  to  money  lotteries  we  are  not  able  to  say  much  about  how  an 
individual  would  choose  among  lotteries.  What  we  need  is  a theory  of 
“rational”  preferences  over  lotteries  that  is  general  enough  to  cover  lotteries 
whose  outcomes  are  not  just  sums  of  money  and  is  capable  of  accounting  for 
different  attitudes  to  risk  in  the  case  of  money  lotteries.  One  such  theory  is  the 
theory  of  expected  utility,  to  which  we  now  turn. 

Test  your  understanding  of  the  concepts  introduced  in  this  section,  by  going 
through  the  exercises  in  Section  4.E.1  of  Appendix  4.E  at  the  end  of  this  chapter. 


4.2  Expected  utility:  theorems 

The  theory  of  expected  utility  was  developed  by  the  founders  of  game  theory, 
namely  John  Von  Neumann  and  Oskar  Morgenstern,  in  their  1944  book 
Theory  of  Games  and  Economic  Behavior.  In  a rather  unconventional  way,  we 
shall  first  (in  this  section)  state  the  main  result  of  the  theory  (which  we  split 
into  two  theorems)  and  then  (in  the  following  section)  explain  the  assumptions 
(or  axioms)  behind  that  result.  The  reader  who  is  not  interested  in 
understanding  the  conceptual  foundation  of  expected  utility  theory,  but  wants 
to  understand  what  the  theory  says  and  how  it  can  be  used,  can  thus  study  this 
section  and  skip  the  next.  First  we  review  the  notation. 

Let  O be  a set  of  basic  outcomes.  Note  that  a basic  outcome  need  not  be  a sum  of 
money:  it  could  be  the  state  of  an  individual’s  health,  or  whether  the  individual 
under  consideration  receives  an  award,  or  whether  it  will  rain  on  the  day  of  her 
planned  outdoor  party,  etc.  Let  L(0)  be  the  set  of  simple  lotteries  (or 
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probability  distributions)  over  O.  We  will  assume  throughout  that  O is  a finite 
set:  O = {o,  ,o2,...,om)  . Thus  an  element  of  £(0)  is  of  the  form 


°\  °2 


VPi  Pi 


Pm) 


with  0 < pt<  1,  for  all  i = 1, 2, m and  pl  + p2  + ...  + pm  = 1 . 


We  will  use  the  symbol  L (with  or  without  subscript)  to  denote  an  element  of 
£(0),  that  is,  a simple  lottery.  Lotteries  are  used  to  represent  situations  of 
uncertainty.  For  example,  if  m = 4 and  the  individual  faces  the  lottery 


L = 


o , o. 


2,  0 

V 5 u 


2 

5 J 


then  she  knows  that,  eventually,  the  outcome  will  be  one 


and  only  one  of  ol,o2,oi,o4,  but  does  not  know  which  one;  furthermore,  she  is 
able  to  quantify  her  uncertainty  by  assigning  probabilities  to  these  outcomes. 
We  take  these  probabilities  either  as  objectively  obtained  from  relevant  past 
data  or  as  subjective  estimates  by  the  individual.  For  example,  an  individual 
who  is  considering  whether  or  not  to  insure  her  bicycle  against  theft  for  the 
following  12  months,  knows  that  there  are  two  relevant  basic  outcomes:  either 
the  bicycle  will  be  stolen  or  it  will  not  be  stolen.  Furthermore,  she  can  look  up 
data  about  past  bicycle  thefts  in  her  area  and  use  the  proportion  of  bicycles  that 
was  stolen  as  an  “objective”  estimate  of  the  probability  that  her  bicycle  will  be 
stolen;  alternatively,  she  can  use  a more  subjective  estimate:  for  example  she 
might  use  a lower  probability  of  theft  than  suggested  by  the  data  because  she 
knows  herself  to  be  very  conscientious  and  - unlike  other  people  - to  always 
lock  her  bicycle  when  left  unattended. 


The  assignment  of  zero  probability  to  a particular  basic  outcome  is  taken  to  be 
an  expression  of  belief  not  of  impossibility:  the  individual  is  confident  that  the 
outcome  will  not  arise,  but  she  cannot  rule  out  that  outcome  on  logical 
grounds  or  by  appealing  to  the  laws  of  nature. 


Among  the  elements  of  £{0 ) there  are  the  degenerate  lotteries  that  assign 
probability  f to  one  basic  outcome:  for  example,  if  m = 4 one  degenerate 


lottery  is 


To  simplify  the  notation  we  will  often  denote 


0 0 1 0^ 

degenerate  lotteries  as  basic  outcomes,  that  is,  instead  of  writing 


°l  °2  °3  °4 

0 0 10 


we  will  simply  write  o3 . Thus,  in  general,  the  degenerate 


lottery 


o, 


o, 


o,  o. 


+i 


0 0 0 1 0 


...  om 
0 0 


will  be  denoted  by  or  As  another 


simplification,  we  will  often  omit  those  outcomes  that  are  assigned  zero 
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probability.  For  example,  if  m = 4,  the  lottery 
written  more  simply  as 


°\  °2 

a ° 


o3  o4 

1 0 


will  be 


f°i 
1 2 

V 3 3 J 


Throughout  this  chapter  we  shall  call  the  individual  under  consideration  the 
Decision-Maker,  or  DM  for  short.  The  theory  of  expected  utility  assumes  that 
the  DM  has  a complete  and  transitive  ranking  <Z  of  the  elements  of  £(0) 
(indeed,  this  is  one  of  the  axioms  listed  in  the  next  section).  By  completeness, 
given  any  two  lotteries  L and  L' , either  L>  L’  (the  DM  prefers  L to  L’)  or 
L'  >-  L (the  DM  prefers  L'  to  L)  or  L ~ L'  (the  DM  is  indifferent  between  L 
and  Z').  Furthermore,  by  transitivity,  for  any  three  lotteries  ZPZ2  and  Z3,  if 
Z,  zz  L2  (the  DM  considers  Lx  to  be  at  least  as  good  as  L2)  and  L2  Z3  then 
Lx  Z3 . Besides  completeness  and  transitivity,  a number  of  other  “rationality” 
constraints  are  postulated  on  the  ranking  of  the  elements  of  £{0)\  these 
constraints  are  the  so-called  Expected  Utility  Axioms  and  are  discussed  in  the 
next  section. 


Definition  4.2.  A ranking  it  of  the  elements  of  £(0)  that  satisfies  the 
Expected  Utility  Axioms  is  called  a von  Neumann-Morgenstern  ranking. 

The  following  two  theorems  are  the  key  results  in  the  theory  of  expected 
utility. 


Theorem  4.1  [von  Neumann-Morgenstern,  1944].  Let  O = [ol,o2,...,om]  be  a 
set  of  basic  outcomes  and  £(0)  the  set  of  lotteries  over  O.  If  is  a von 
Neumann-Morgenstern  ranking  of  the  elements  of  £(0)  then  there  exists  a 
function  U : O — » 1R , called  a von  Neumann-Morgenstern  utility  function,  that 
assigns  a number  (called  utility)  to  every  basic  outcome  and  is  such  that,  for  any 

two  lotteries  Z = 


0, 

..  0 ^ 

0,  . 

..  0 ) 

2 

m 1 

and  Z' = 

2 

u 

Pi  ■ 

- Pm) 

vZ, 

<h  ■ 

” ^ m J 

L>~  L'  if  and  only  if  E[U (Z)]  > E[U (Z')]  and 
Z ~ Z'  if  and  only  if  E[U (Z)]  = E[U (Z')] 


where 

U(Z): 


U{o2 ) .. 

■ U(oJ\ 

,U(L')  = 

U(o2 ) .. 

■ U(oS 

V Pi  Pi 

■■  pm  J 

V ch 

q2 

■■  , 
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E[C/ (Z)]  is  the  expected  value  of  the  lottery  U (Z)  and  E[t/ (Z')]  is  the  expected 
value  of  the  lottery  U (Z') , that  is, 

WU  (L)]  = PXU  (o,)  + p2U  (o2)  + ...  + pmU  ( oj 

^[U (Z')l  = qfJ (o,)  + q2U (o2)  + ...  + qmU (oj . 

E[Z/ (Z)]  is  called  the  expected  utility  of  lottery  L (and  E[U(L')\  the  expected 
utility  of  lottery  Z' ). 

We  say  that  any  function  £7:0— that  satisfies  the  property  that,  for  any 
two  lotteries  Z and  Z' , Z Z'  if  and  only  if  E[£7 (Z)]  > IE[Z/ (Z')J  represents  the 
preferences  (or  ranking)  . 

Before  we  comment  on  Theorem  4.1  we  give  an  example  of  how  one  can  use  of 
it.  Theorem  4.1  sometimes  allows  us  to  predict  an  individual’s  choice  between 
two  lotteries  C and  D if  we  know  how  that  individual  ranks  two  different 
lotteries  A and  B.  For  example,  suppose  we  observe  that  Susan  is  faced  with 
the  choice  between  lotteries  A and  B below  and  she  says  that  she  prefers  A to  B 


( 0,  o7 

03  'l 

' °1  °2  " 

A = 

I z 

B = 

v0  0.25 

0.75; 

v0.2  0 0.8, 

With  this  information  we  can  predict  which  of  the  following  two  lotteries  C 
and  D she  will  choose,  if  she  has  von  Neumann-Morgenstern  preferences: 


( 0,  o. 

°3  ^ 

f °1  °2 

C = 

1 2 

D = 

0 

00 

0 

0.2  ) 

O 

O 

Applying  the  theorem,  let  U be  a von  Neumann-Morgenstern  utility  function 
whose  existence  is  guaranteed  by  Theorem  4.1.  Let  £/(oi)  = a,  U(oz)  = b and 
1/(03)  = c (where  a,  b and  c are  numbers).  Then,  since  Susan  prefers  A to  B,  the 
expected  utility  of  A must  be  greater  than  the  expected  utility  of  B: 
0.25Z  + 0.75c  > 0.2<2  + 0.8c.  This  inequality  is  equivalent  to  0.25 b > 0.2 a + 0.05c 
or,  dividing  both  sides  by  0.25,  b > 0.8(2  + 0.2c.  It  follows  from  this  and 
Theorem  4.1  that  Susan  prefers  D to  C,  because  the  expected  utility  of  D is  b 
and  the  expected  utility  of  C is  0.8^  + 0.2c.  Note  that,  in  this  example,  we 
merely  used  the  fact  that  a von  Neumann-Morgenstern  utility  function  exists, 
even  though  we  don’t  know  what  the  values  of  this  function  are. 

Theorem  4.1  is  an  example  of  a representation  theorem  and  is  a generalization  of 
a similar  result  for  the  case  of  the  ranking  of  a finite  set  of  basic  outcomes  O.  It 
is  not  difficult  to  prove  that  if  is  a complete  and  transitive  ranking  of  O then 
there  exists  a function  U : O — » M , called  a utility  function  (see  Chapter  1),  such 
that,  for  any  two  basic  outcomes  0,0'eO,  U{o)>U (o')  if  and  only  if  o ^ o' . 
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Now,  it  is  quite  possible  that  an  individual  has  a complete  and  transitive 
ranking  of  O,  is  fully  aware  of  her  ranking  and  yet  she  is  not  able  to  answer 
the  question  “what  is  your  utility  function?”,  perhaps  because  she  has  never 
heard  about  utility  functions.  A utility  function  is  a tool  that  we  use  to 
represent  her  ranking,  nothing  more  than  that.  The  same  applies  to  von 
Neumann-Morgenstern  rankings:  Theorem  4.1  tells  us  that  if  an  individual  has 
a von  Neumann-Morgenstern  ranking  of  the  set  of  lotteries  £(0)  then  there 
exists  a von  Neumann-Morgenstern  utility  function  that  we  can  use  to 
represent  her  preferences,  but  it  would  not  make  sense  for  us  to  ask  the 
individual  “what  is  your  von  Neumann-Morgenstern  utility  function?”  (indeed 
this  was  a question  that  could  not  even  be  conceived  before  von  Neumann  and 
Morgenstern  stated  and  proved  Theorem  4.f  in  1944!).  Theorem  4.1  tells  us 
that  such  a function  exists.  The  next  theorem  can  be  used  to  actually  construct 
such  a function  by  asking  the  individual  to  answer  a few  questions,  formulated 
in  a way  that  is  fully  comprehensible  to  her  (that  is,  without  using  the  word 
‘utility’). 

The  next  theorem  says  that,  although  there  are  many  utility  functions  that 
represent  a given  von  Neumann-Morgenstern  ranking,  once  you  know  one 
function  you  know  them  all,  in  the  sense  that  there  is  a simple  operation  that 
takes  you  from  one  function  to  the  other. 

Theorem  4.2  [von  Neumann-Morgenstern,  1944].  Let  be  a von 

Neumann-Morgenstern  ranking  of  the  set  of  basic  lotteries  £(0) , where 

O = (o1,o2,...,oJ  . Then  the  following  are  true. 

(A)  If  U .O— » M.  is  a von  Neumann-Morgenstern  utility  function  that 
represents  fe,  then,  for  any  two  real  numbers  a and  b with  a > 0,  the 
function  L:0— defined  by  L(o(.)  = aU^o^  + b ( i = 1,2 is  also  a 
von  Neumann-Morgenstern  utility  function  that  represents  <3 . 

(B)  If  U : O — » R and  V : O — » M are  two  von  Neumann-Morgenstern  utility 
functions  that  represent  , then  there  exist  two  real  numbers  a and  b with 
a > 0 such  that  V(oi)  = aU(oi)  + b ( i = 1,2, 


Proof.  The  proof  of  (A)  of  Theorem  4.2  is  very  simple.  Let  a and  b be  two 
numbers,  with  a > 0.  The  hypothesis  is  that  U : O — > M.  is  a von  Neumann- 
Morgenstern  utility  function  that  represents  it,  that  is,  that,  for  any  two 

lotteries  L = 


■ 

..  0 ^ 

■ 

..  o \ 

m 1 

and  L'  = 

m 

U • 

- PnJ 

N,  • 

**  Qm ) 
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L L'  if  and  only  if 

plU(ol)  + ...  + pmU(om)  > qlU(ol)  + ...  + qmU(om ) 

Multiplying  both  sides  of  the  above  inequality  by  a > 0 and  adding 
(/?!+...  + pm)b  to  both  sides  we  obtain 

plU(ol)  + ...  + pmU(om)  > qfJ{ol)  + ...  + qmU{om) 
if  and  only  if 

pl[aU(ol)  + b\  + ...  + pm  [aU (om)  + b]  > ql[aU(ol)  + b]  + ...  + qm  [aU (om)  + b] 

(♦) 

Defining  V(oj)  = aU(oj)  + b , it  follows  from  (*)  and  (♦)  that 

Lt  L’  if  and  only  if 

Py(o1)  + ...  + pnV(om)  > q1V(ol ) + ...  + qmV(om ) 

That  is,  the  function  V is  a von  Neumann-Morgenstern  utility  function  that 
represents  the  ranking  <; . 

The  proof  of  Part  (B)  will  be  given  later,  after  introducing  some  more  notation 
and  some  observations.  ■ 

Since  among  the  lotteries  there  are  the  degenerate  ones  that  assign  probability  1 
to  a single  basic  outcome,  this  implies  that  the  DM  has  a complete  and 
transitive  ranking  of  the  basic  outcomes.  We  shall  write  obest  for  a best  basic 
outcome,  that  is,  a basic  outcome  which  is  at  least  as  good  as  any  other 
outcome  (obest  it  o,  for  every  o e O)  and  oworst  for  a worst  basic  outcome,  that  is, 
a basic  outcome  such  that  every  other  outcome  is  at  least  as  good  as  it 
(o  oworst,  for  every  oeO).  Note  that  there  may  be  several  best  outcomes  (then 
the  DM  would  be  indifferent  among  them)  and  several  worst  outcomes.  We 
shall  assume  throughout  that  the  DM  is  not  indifferent  among  all  the 
outcomes,  that  is,  we  shall  assume  that  obest  >-  oworst . 

We  now  show  that,  in  virtue  of  Theorem  4.2,  among  the  von  Neumann- 
Morgenstern  utility  functions  that  represent  a given  von  Neumann- 
Morgenstern  ranking  of  £(0) , there  is  one  that  assigns  the  value  1 to  the 
best  basic  outcome(s)  and  the  value  0 to  the  worst  basic  outcome(s).  To  see  this, 
consider  an  arbitrary  von  Neumann-Morgenstern  utility  function  F:0— 
that  represents  and  define  G : O — » 1R  as  follows:  for  every  oeO, 
G(o)  = F (o)  - F (oworsl) . Then,  by  Theorem  4.5,  G is  also  a utility  function  that 
represents  and,  by  construction,  G(oworst)  = F(oworJ- F(oworst)  = 0;  note  also 
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that,  since  obest  >-  oworst , it  follows  that  G(obest)  > 0 . Finally,  define  U : O — » K.  as 

Cj(o') 

follows:  for  every  o eO  , U(o)  = . Then,  by  Theorem  4.2,  U is  a utility 

G(°beJ 

function  that  represents  and,  by  construction,  U (oworst)  = 0 and  U(obesl)  = 1. 
For  example,  if  there  are  six  basic  outcomes  and  the  ranking  of  the  basic 
outcomes  is  o3  ~ o6  >-  ox  >-  oA  >-  o2  ~ o5 , then  one  can  take  as  obest  either  o3  or  o6 
and  as  oworst  either  o2  or  o5 ; furthermore,  if  F is  given  by 

outcome  ox  o2  o3  o4  o5  o6 

F 2 -2  8 0 -2  8 

then  G is  the  function  1 2 3 4 5 6 and  jj  fs  the  function 

4 0 10  2 0 10 

°\  °2  °3  °4  °5  °6 

0.4  0 1 0.2  0 1 

Definition  4.3.  Let  U : O — > R be  a utility  function  that  represents  a given  von 
Neumann-Morgenstern  ranking  of  the  set  of  lotteries  L{0).  We  say  that  U 
is  the  normalized  utility  function  if  U ( oworsl ) = 0 and  U (obest ) = 1 . 

The  transformations  described  above  show  how  to  normalize  any  given  utility 
function. 

Armed  with  the  notion  of  normalized  utility  function  we  can  now  complete 
the  proof  of  Theorem  4.2. 

Proof  of  Part  (B)  of  Theorem  4.2.  Let  F : O — » M and  G : O — » R be  two  von 
Neumann-Morgenstern  utility  functions  that  represent  a given  von  Neumann- 
Morgenstern  ranking  of  L{0) . Let  U : O — » R be  the  normalization  of  F and 
V : O — » M be  the  normalization  of  G.  First  we  show  that  it  must  be  that  U = 
V,  that  is,  U (o)  = V(o)  for  every  ogO.  Suppose,  by  contradiction,  that  there 
is  an  o eO  such  that  U(d)^V  (o) . Without  loss  of  generality  we  can  assume 

(o  o \ 

that  U(d)>V (<3) . Construct  the  following  lottery:  L = best  worst  with 

VP  l~P) 

p = U(d) . Then  E[C (Z)]  = E[L (Z)]  = U(d).  Hence,  according  to  U it  must  be 
that  d~L  (this  follows  from  Theorem  4.1),  while  according  to  V it  must  be 
(again,  by  Theorem  4.1)  that  Z >~  d (since  E[L(Z)]  = U(d)  > V(o)).  Then  U and 
V cannot  be  two  representations  of  the  same  ranking. 
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Now  let 


1 


a , =■ 


and  h = F(°™rst) _ Note  that 

F(°best ) - F(°  worst ) F (°bes, ) " F(°  worst ) 

a,  > 0 . Then  it  is  easy  to  verify  that,  for  every  oeO,  U(o)  = axF(o ) + bx . 


Similarly  let  a2  = 


1 


and  bw  = -- 


G(°  worst) 


- ; again, 

G(obest)-G(oworst)  ‘ G(obest)-G(oworst) 

a2>  0 and,  for  every  oeO,  V(o)  = a2G(o)  + b2.  We  can  invert  the  latter 

transformation  and  obtain  that,  for  every  o e O , G(o)  = ^-^--—.  Thus  we 

can  transform  F into  U,  which  - as  proved  above  - is  the  same  as  V,  and  then 
transform  V into  G as  follows:  G(o ) = aF(o ) + b where  a = — > 0 and 

<7, 


Remark  4.2.  Theorem  4.2  is  often  stated  as  follows:  a utility  function  that 
represents  a von  Neumann-Morgenstern  ranking  of  L(0)  is  unique  up  to  a 

positive  affine  transformation ."  Because  of  Theorem  4.2,  a von  Neumann- 
Morgenstern  utility  function  is  usually  referred  to  as  a cardinal  utility  function. 


Theorem  4.1  guarantees  the  existence  of  a utility  function  that  represents  a 
given  von  Neumann-Morgenstern  ranking  of  £(0)  and  Theorem  4.2 
characterizes  the  set  of  such  functions.  Can  one  actually  construct  a utility 
function  that  represents  a given  ranking?  The  answer  is  affirmative:  if  there  are 
m basic  outcomes  one  can  construct  an  individual’s  von  Neumann-Morgenstern 
utility  function  by  asking  her  at  most  (m  — 1)  questions.  The  first  question  is 
“what  is  your  ranking  of  the  basic  outcomes?”.  Then  we  can  construct  the 
normalized  utility  function  by  first  assigning  the  value  1 to  the  best  outcome (s) 
and  the  value  0 to  the  worst  outcome(s).  This  leaves  us  with  at  most  (in- 2) 
values  to  determine.  For  this  we  appeal  to  one  of  the  axioms  discussed  in  the 
next  section,  namely  the  Continuity  Axiom,  which  says  that,  for  every  basic 
outcome  oj  there  is  a probability  pj  e [0, 1]  such  that  the  DM  is  indifferent 
between  oi  for  certain  and  the  lottery  that  gives  a best  outcome  with 


probability  pt 


O best  O worst 

V Pi  l~Pi 


and  a worst  outcome  with  probability  (l  ~pf)'- 
Thus,  for  each  basic  outcome  oi  for  which  a utility  has  not 


’ An  affine  transformation  is  a function  f ;R->R  of  the  form  f(x)  = ax  + b with  a, be  R • The 
affine  transformation  is  positive  if  a > 0. 
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been  determined  yet,  we  should  ask  the  individual  to  tell  us  the  value  of  pi 


such  that  o, 


^ best  ^ worst 


Pi  l~Pi 
expected  utility  of  the  lottery 

PiQ)  + {\-Pi)Q  = Pi- 


; then  we  can  then  set  Uj{oi)  = pj,  because  the 
is  pfJt (obest ) + ( 1 -p,)Ui (oworst ) = 


®best  ® worst 

V Pi  1 - P, 


Example  4.1.  Suppose  that  there  are  five  basic  outcomes,  that  is, 
O = {o,,o2,o3,o4,o5}  and  the  DM,  who  has  von  Neumann-Morgenstern 
preferences,  tells  us  that  her  ranking  of  the  basic  outcomes  is  as  follows: 


°2  °\  °5  °3 


y4- 


Then  we  can  begin  by  assigning  utility  1 to  the  best  outcome  o,  and  utility  0 to 


the  worst  outcomes  o,  and  o. : 


^outcome  o l o2  o3  o4  o5  A 


. There  is  only 


utility  ? 1 0 0 

one  value  left  to  be  determined,  namely  the  utility  of  o,  (which  is  also  the 
utility  of  o5 , since  ~o5).  To  find  this  value,  we  ask  the  DM  to  tell  us  what 


value  of  p makes  her  indifferent  between  the  lottery  L = 


and 


VP  l -p) 

outcome  ox  with  certainty.  Suppose  that  her  answer  is:  0.4.  Then  her 
normalized  von  Neumann-Morgenstern  utility  function  is 


^outcome  o,  o2  o3  o4  o5  ' 

v utility  0.4  1 0 0 0.4y 

among  any  set  of  lotteries  over  the  five  basic  outcomes. 


. Knowing  this,  we  can  predict  her  choice 


Test  your  understanding  of  the  concepts  introduced  in  this  section,  by  going 
through  the  exercises  in  Section  4.E.2  of  Appendix  4.E  at  the  end  of  this  chapter. 
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4.3  Expected  utility:  the  axioms 

We  can  now  turn  to  the  list  of  “rationality”  axioms  proposed  by  von  Neumann 
and  Morgenstern.  Let  O - {ol,o2,...,om}  be  the  set  of  basic  outcomes  and  £{0) 

the  set  of  simple  lotteries,  that  is,  the  set  of  probability  distributions  over  O. 
Let  be  a binary  relation  on  £{0) . We  say  that  <3  is  a von  Neumann- 
Morgenstern  ranking  of  £(0)  if  it  satisfies  the  following  four  axioms  or 
properties. 

Axiom  1 [Completeness  and  transitivity],  it  is  complete  and  transitive. 

As  noted  in  the  previous  section,  Axiom  1 implies  that  there  is  a complete  and 
transitive  ranking  of  the  basic  outcomes.  Recall  that  obest  denotes  a best  basic 
outcome  and  oworst  denotes  a worst  basic  outcome  and  that  we  are  assuming 
that  obest  y oworst , that  is,  that  the  DM  is  not  indifferent  among  all  the  basic 
outcomes. 


Axiom  2 [Monotonicity]. 


J best 


v P 


1 -p) 


> 


ybest 


O worst 

i -q, 


if  and  only  if  p>q. 


Axiom  3 [Continuity],  For  every  basic  axioms  o(  there  is  a pt  e [0,1]  such  that 

®best  O worst 

°i~  , 

\ Pi  I” A, 


Before  we  introduce  the  last  axiom  we  need  one  more  definition. 


f xx  x2  ...  O 


Definition  4.1.  A compound  lottery  is  a lottery  of  the  form 

VP  i Pi  - Pr) 

where  each  x,  is  either  an  element  of  O or  an  element  of  £(0) . Given  a 


compound  lottery  C = 


Xj  x2 


\P\  Pi 


Pr J 


the  corresponding  simple  lottery 


°\  °i 


L{C)  = 

Wi  <h 
= 1 ,...,r,  define 


A 


CL) 


is  defined  as  follows.  First  of  all,  for  i = 1 ,...,m  and  j 


169 


GAME  THEORY  - Giacomo  Bonanno 


o,(x,)  = 


1 if  Xj  = oj 

0 if  Xj  = ok  with  k T i 


if  Xj  = 


Vsi 


°i- 1 °i  °M 


J/+12 


...  0„ 


Then  qi=YJPj°i(xj)- 

j= i 

For  example,  let  w = 4.  Then  L = 


(ox  o2  o3 


2 

V 5 


1 2 

5 5 J 


ffox  o2  o3  o4A 


element  of  £(0)),  while  C - 


i i 

V 3 6 


1 1 

3 6 J 


is  a simple  lottery  (an 


fox  o2  o3  o4^ 


i 

V 5 


0 


i 1 

5 5 J 


is  a 


III 

V 2 4 4 j 

compound  lottery  (with  r = 3).  The  compound  lottery  C can  be  viewed 
graphically  as  a tree,  as  shown  in  Figure  4.1. 


Then  o1(x1)  = |,  o1(x2)  = l ando,(x3)  = i so  that  qx  =j(j)  + {(l)+{(})=  §• 

Similarly,  q2  = |(|)  + j(0)+i(0)  = ± = ^ , q3  = i(l) + {(°)+{(})  = io  and 

q4  = i(i) + i(0)+ i(i)  = 60  • These  numbers  correspond  to  multiplying  the 

probabilities  along  the  edges  of  the  tree  of  Figure  4.1  leading  to  an  outcome,  as 
shown  in  Figure  4.2  (a)  and  then  adding  up  the  probabilities  of  each  outcome, 
as  shown  in  Figure  4.2  (b).  Thus  the  simple  lottery  L(C)  that  corresponds  to 


C is  L(C)  = 


o , o. 


28 
V 60 


_5_ 

60 


11 

60 


ii 
60  J 


, namely  the  lottery  shown  in  Figure  4.2  (b). 
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Figure  4.2 


Axiom  4 [Independence  or  substitutability].  Consider  an  arbitrary  basic 
outcome  oi  and  an  arbitrary  simple  lottery 


L = 


o , 


O; 


O, 


. If  L is  a simple  lottery  such  that  oi  ~ L 


'i-\  °i  °i+ 1 

\Pi  - Pi- 1 P>  Pu  i - AJ 
then  L ~ M where  Af  is  the  simple  lottery  corresponding  to  the  compound 


f 


lottery  C - 


o , 


o, 


- Pi- 1 P 
replacing  o(  with  L in  L. 


L 


A+i 


A 

Pm  j 


(Definition  4.9)  obtained  by 
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We  can  now  prove  the  first  theorem  of  the  previous  section. 

Proof  of  Theorem  4.1.  To  simplify  the  notation,  throughout  this  proof  we 
will  assume  that  we  have  renumbered  the  basic  outcomes  in  such  a way  that 

°best  = °1  and  = °,n  ■ 

First  of  all,  for  every  basic  outcome  o, , let  ut  e [0, 1]  be  such  that 
fa  o'] 

°i~  , 

\ui 

The  existence  of  such  values  ui  's  is  guaranteed  by  the  Continuity  Axiom 
(Axiom  3);  clearly  ul  = 1 and  um  = 0 . Now  consider  an  arbitrary  lottery 


A = 


\Pi 


°m  ' 

Pm  y 


. First  we  show  that 


L 


Iw  l~YJPiui 


V i= 1 


i=l 


(t) 


This  is  done  through  a repeated  application  of  the  Independence  Axiom 

(Axiom  4),  as  follows:  consider  the  compound  lottery 


obtained  by  replacing  o2  in  lottery  Lx 


c2  = 

f 

°1 

°,n  1 

^ ^2  J 

o3  . 

X 

•• 

Ia 

Pi 

- Pm , 

with  the  lottery 


v N f ^2  j 


The 


f 


L2  = 


simple 

o , o , 


that  the  DM  considers  to  be  just  as  good  as  o2. 
lottery  corresponding  to  C2 


is 


. Note  that  o2  is  assigned 


1 ^3  •"  V 1 

VP1+P2U2  Pi  - A„-i  A„+jP2(1-m2)J 

probability  0 in  Z2  and  thus  we  have  omitted  it.  By  Axiom  4,  Lx~  L2.  Now 


apply 


the 


same 


argument 


to 


let 


0 , 

fa  0 ^ 

\ 

0„, 

^3  = 

1 

1 y 

m 

vA  + /FN 

Pi 

...  A„+/,2(1-M2)y 

whose  corresponding 


simple  lottery  is 


L2  = 


{p1+p2u2+piu3  ...  pm+p2(l-u2)  + p2(l-ui)J 


172 


GAME  THEORY  - Giacomo  Bonanno 


Note,  again,  that  o3  is  assigned  probability  zero  in  Z3 . By  Axiom  4,  L2  ~ L3; 
thus,  by  transitivity  (since  LX~L2  and  L2~L3)  we  have  that  LX~L3. 
Repeating  this  argument  we  get  that  Lx  ~ Lm  l , where 


An- 1 


Pi  + PlU2  +■■■  + Pm- l“m-l  Pm  + P2  C1  “ «2)  + ••  + Pm-i  C1  ~ ) 


Since  ul  = 1 and  um  = 0 , 

m 

P\  + PlU2  +■■■  + Pm- Vm-i  = A“ 1 + /h«2  + - + P,n-^m-i  + PmK,  = X A1 


m m- 1 


m m-1 


/?2  (i  - w2 ) + . . . + pm_x  (i  - um  l ) + Pm  = X a - X Ptut  = Pi  + X Pi  - X m - Pi 


i=2  1=2 


i=2  i=2 


m m-1 


- , Xa-Xw-a«i-p, 


— DU 

Jr  m m 


i= 1 i=2 


= 1 -X/¥ 

since  Z=1 


Thus  Z j = •A  ■A  > which  proves  (f ). 

Xaa  1 - Xa", 

v (=i  m y 

Now  define  the  following  utility  function  £/  : — > [0,1] : U{oi)  = ui, 

where,  as  before,  for  every  basic  outcome  on  «(e[0, 1]  is  such  that 

r ox  °m  ^ . . f o j ...  o ^ 

o(  ~ . Consider  two  arbitrary  lotteries  L = and 

U<  l~ui)  lA  - Pm) 

t f °\  ■■■  °m  1 . 

L'  = . We  want  to  show  that  LcL1  if  and  only  if 

Ui  - aJ 

m m 

E[C/(Z)]  > E[£/(Z')] , that  is,  if  and  only  if  X PMi  - X AT  • By  (t)>  L~M, 


where 


rn  rn 

X pp.  1 - X pi 


and  also  where 


M'  = . Thus,  by  transitivity  of  , Z it  Z'  if  and  only  if 

XaA  '“XaA 
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M'itM'-,  by  the  Monotonicity  Axiom  (Axiom  2),  M'itM'  if  and  only  if 

m m 

1=1  1=1 


The  following  example,  known  as  the  Allais  paradox,  suggests  that  one  should 
view  expected  utility  theory  as  a prescriptive  or  normative  theory  (that  is,  as  a 
theory  about  how  rational  people  should  choose)  rather  than  as  a descriptive 
theory  (that  is,  as  a theory  about  the  actual  behavior  of  individuals).  In  1953  the 
French  economist  Maurice  Allais  published  a paper  regarding  a survey  he  had 
conducted  in  1952  concerning  a hypothetical  decision  problem.  Subjects  “with 
good  training  in  and  knowledge  of  the  theory  of  probability,  so  that  they  could 
be  considered  to  behave  rationally”  were  asked  to  rank  the  following  pairs  of 
lotteries: 


and 


A = 


^$5  million 

89 

V 100 


$ck 

n 

100 


versus 


^$1  million 

90 

V 100 


$0" 

10 

ioo  y 


^$5  million 

$ 1 million 

$0^ 

versus  D = 

"$1  million^ 

89 

10 

1 

v 1 J 

V ioo 

100 

Tooy 

Most  subjects  reported  the  following  ranking: 

Ay  B and  Dy  C . 


Such  ranking  violates  the  axioms  of  expected  utility.  To  see  this,  let 
O = {o,,o2,o3}  with  Oj  = $5  million , o2  = $1  million  and  o3  = $0.  Let  us  assume 

that  the  individual  in  question  prefers  more  money  to  less:  o^y  o2y  o3  and  has 

a von  Neumann-Morgenstern  ranking  of  the  lotteries  over  O . Let  u2  e (0,1)  is 

^$5  million  $0  ) . . 

(the  existence  of  such  u2  is  guaranteed  by 


such  that  D 


1 -u 


2 J 


the  Continuity  Axiom).  Then,  since  D y C , by  transitivity 


^$5  million 

V U2 


$0  " 

l~U2 , 


>-  c 


(*) 


3 See  http:/ / en.wikipedia.org/ wiki/ Allais  paradox  . 
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Let  C'  be  the  simple  lottery  corresponding  to  the  compound  lottery 


$5  million 


$5  million  $0 


1 - it 


2 J 


89 

100 


10 

100 


$0 


100  J 


C'  = 


$5  million 


$0 


_89_  + J0_,y  1 -(&-  + ) 

, 100  ^ 100  m2  1 v 100  T 100  l,2  J; 

By  the  Independence  Axiom,  C ~ C'  and  thus,  by  (*)  and  transitivity, 


f $5  million  $0  'j  ( $5  million 


V 


Ur. 


1 ~U 


2 J 


$0 


_89_  . JO,,  1 _t_§9_+J0_w  I 
100  T 100  u2  A \ 100  ^ 100  u2  ) 


Then 


Hence,  by  the  Monotonicity  Axiom,  u2  > , that  is, 


u2 


89 

90 


(♦) 


Let  B'  be  the  simple  lottery  corresponding  to  the  compound  lottery 

"Y$5  million  $0  i -T  ^ to  mfflion  $0  ' 

Then  B'  = I 


V 


1 — w 


$0 


2 J 


90 

100 


JO 
100  J 


100  U2 


1 --2 2-U 
1 100  W2J 


By  the 


Independence  Axiom,  B ~ B' ; thus,  since  Ay  B , by  transitivity  Ay  B'  and 
therefore,  by  the  Monotonicity  Axiom,  m>mu2’  that  is,  u2<  ||, 

contradicting  (♦).  Thus,  if  one  finds  the  expected  utility  axioms  compelling  as 
axioms  of  rationality,  then  one  cannot  consistently  express  a preference  for  A 
over  B and  also  a preference  for  D over  C. 


Another  well-known  paradox  is  the  Ellsberg  paradox.  Suppose  you  are  told  that 
an  urn  contains  30  red  balls  and  60  other  balls  that  are  either  blue  or  yellow. 
You  don't  know  how  many  blue  or  how  many  yellow  balls  there  are,  but  the 
number  of  blue  balls  plus  the  number  of  yellow  ball  equals  60  (they  could  be  all 
blue  or  all  yellow  or  any  combination  of  the  two).  The  balls  are  well  mixed  so 
that  each  individual  ball  is  as  likely  to  be  drawn  as  any  other.  You  are  given  a 
choice  between  the  bets  A and  B,  where 

A = you  get  $100  if  you  pick  a red  ball  and  nothing  otherwise, 

B = you  get  $100  if  you  pick  a blue  ball  and  nothing  otherwise. 


Many  subjects  in  experiments  state  a strict  preference  for  A over  B : Ay  B . 
Consider  now  the  following  bets: 
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C = you  get  $100  if  you  pick  a red  or  yellow  ball  and  nothing  otherwise, 

D = you  get  $100  if  you  pick  a blue  or  yellow  ball  and  nothing  otherwise. 

Do  the  axioms  of  expected  utility  constrain  your  ranking  of  C and  D}  Many 
subjects  in  experiments  state  the  following  ranking:  Ay  B and  D^zC . All 
such  people  violate  the  axioms  of  expected  utility.  The  fraction  of  red  balls  in 
the  urn  is  } Let  p2  be  the  fraction  of  blue  balls  and  p2  the  fraction  of 
yellow  balls  (either  of  these  can  be  zero:  all  we  know  is  that  p2  + p2  = fj(  = -§■  ). 
Then  A,  B,  C and  D can  be  viewed  as  the  following  lotteries: 


o 

o 

o . 

V3- 

B = 

o 

o 

V3- 

$0  > 

c= 

"$100 

$0" 

l I Pl+PlJ 

l,  Pi 

3+P3, 

Pi, 

" $100  $0A 
yPl+P,  =f 


Let  U be  the  normalized  von  Neumann-Morgenstern  utility  function  that 
represents  the  individual’s  ranking;  then  L/($  1 00)  = 1 and  L/ (0)  = 0 . Thus 

E[t/(^4)]  = j,  E [f/(5)]  = p2 , E [17(C)]  = j + p2  and  E [U(D)]  = p2+ p2  =j  . 
Hence,  Ay  B if  and  only  if  } > p2 , which  implies  that  p3  > } , so  that 
E [t/  (C)]  = } + > E[C(D)]  = | and  thus  C y D (similarly,  By  A if  and  only 

if  | < p2 , which  implies  that  E[C(C)]  < E[C(D)]  and  thus  DyC ). 


Test  your  understanding  of  the  concepts  introduced  in  this  section,  by  going 
through  the  exercises  in  Section  4.E.3  of  Appendix  4.E  at  the  end  of  this  chapter. 
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Appendix  4.E:  Exercises 

4.E.I.  Exercises  for  Section  4.1: 

Money  lotteries  and  attitudes  to  risk. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  4.1.  What  is  the  expected  value  of  the  lottery: 


Exercise  4.2.  Consider  the  following  lottery: 

V4  2 

o\  = you  get  an  invitation  to  have  dinner  at  the  White  House 

02  = you  get  (for  free)  a puppy  of  your  choice 

03  = you  get  $600. 

What  is  the  expected  value  of  this  lottery? 


f 


24  12  48  6 


A 


u 


V 


ij 


, where 


Exercise  4.3.  Consider  the  following  money  lottery 


'$10  $15  $18  $20 

A -L  0 — 

V 12  12  v 12 


$25  $30 
£ 0 


$36T 

_3_ 

12  y 


(a)  What  is  the  expected  value  of  the  lottery? 

(b)  Ann  says  that,  between  getting  $20  for  certain  and  playing  the  above 
lottery,  she  would  prefer  the  $20  for  certain.  What  is  her  attitude  to  risk? 

(c)  Bob  prefers  more  money  to  less  and  has  transitive  preferences.  He  says 
that,  given  the  same  choice  as  Ann,  he  would  prefer  playing  the  lottery. 
What  is  his  attitude  to  risk? 


Exercise  4.4.  Sam  has  a debilitating  illness  and  has  been  offered  two  mutually 
exclusive  courses  of  action:  (1)  take  some  well-known  drugs  which  have  been 
tested  for  a long  time  and  (2)  take  a new  experimental  drug.  If  he  chooses  (1) 
then  for  certain  his  pain  will  be  reduced  to  a bearable  level.  If  he  chooses  (2) 
then  he  has  a 50%  chance  of  being  completely  cured  and  a 50%  chance  of  no 
benefits  from  the  drug  and  possibly  some  harmful  side  effects.  He  chose  (1). 
What  is  his  attitude  to  risk? 
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4.E.2.  Exercises  for  Section  4.2:  Expected  utility  theory. 


Exercise  4.5.  Ben  is  offered  a choice  between  the  following  two  money 

*"$4,000  $0j  , ( $3,000"* 

and  B = 


lotteries:  A = 


V 


0.8  0.2 


V 


1 


. He  says  he  strictly 


J 


prefers  B to  A.  Which  of  the  following  two  lotteries,  C and  D,  will  Ben  choose 
if  he  satisfies  the  axioms  of  expected  utility  and  prefers  more  money  to  less? 


C = 


"$4,000 

$0  " 

D = 

"$3,000 

$0  " 

v 0.2 

0.8  j 

v 0.25 

0.75y 

Exercise  4.6.  There  are  three  basic  outcomes,  ox,o2  and  o3 . Ann  satisfies  the 
axioms  of  expected  utility  and  her  preferences  over  lotteries  involving  these 
three  outcomes  can  be  represented  by  the  following  von  Neumann- 
Morgenstern  utility  function:  V(o2)  = a > V(ox)=b  > V(o3)  = c.  Normalize 
the  utility  function. 


Exercise  4.7.  Consider  the  following  lotteries: 


"$3000 

$2000 

$1000 

$500" 

"$3000 

$2000 

$1000 

$500 

Li  = 

5 

0 

0 

1 

, l2  = 

2 

0 

0 

1 

V 6 

6 J 

V 3 

"3 

"$3000 

$2000 

$1000 

$500" 

$3000 

$2000 

$1000 

$500^1 

l3  = 

1 

V 4 

1 

4 

1 

4 

1 

4 J 

5 £4  = 

0 

1 

2 

1 

2 

0 J 

Jennifer  says  that  she  is  indifferent  between  lottery  L\  and  getting  $2,000  for 
certain.  She  is  also  indifferent  between  lottery  Z-2  and  getting  $1,000  for 
certain.  Finally,  she  says  that  between  Z.3  and  Z.4  she  would  choose  L3.  Is  she 
rational  according  to  the  theory  of  expected  utility?  [Assume  that  she  prefers 
more  money  to  less.] 


Exercise  4.8.  Consider  the  following  basic  outcomes:  01  = a Summer  internship 
at  the  White  House,  02  = a free  1-week  vacation  in  Europe,  03  = $800,  0 4 = a 
free  ticket  to  a concert.  Rachel  says  that  her  ranking  of  these  outcomes  is 


ox  y °2  y °3  y °4 . She  also  says  that  (1)  she  is  indifferent  between 


( „ A 


V 1 J 


and 


'°1  0 

4 1 l 

and  (2)  she  is  indifferent  between 

f°3' 

llj 

and 

1 

V 5 5 J 

V 2 

2 J 

. If  she  satisfies 

the  axioms  of  expected  utility  theory,  which  of  the  two  lotteries 


A = 


O ~ 

O3 

°4^ 

o1 

o3" 

2 

and  A = 

2 

1 

2 

3 

2 1 

1 

3 

1 

V 8 

8 

8 

8 J 

V 5 

5 

5 J 

will  she  choose? 
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Exercise  4.9.  Consider  the  following  lotteries: 


L,  = 

'$30 

$28 

$24 

$18 

$8^ 

and  L2  = 

'$30 

$28  $83 

1 

2 

V 10 

1 

10 

i 

10 

2 

10 

4 ' 

10  J 

1 

V 10 

4 5 

io  io  y 

(a)  Which  lottery  would  a risk  neutral  person  choose? 

(b)  Paul’s  von  Neumann-Morgenstern  utility-of-money  function  is  U(m)  = 
In (m),  where  In  denotes  the  natural  logarithm.  Which  lottery  would  Paul 
choose? 

Exercise  4.10.  There  are  five  basic  outcomes.  Jane  has  a von  Neumann- 
Morgenstern  ranking  of  the  set  of  lotteries  over  the  set  of  basic  outcomes  that 
can  be  represented  by  either  of  the  utility  functions  U or  V given  below: 

°\  o2  °3  °4  °s N 

U : 44  170  -10  26  98  . 

yV:  32  95  5 23  59  y 

(a)  Show  how  to  normalize  each  of  U and  V and  verify  that  you  get  the  same 
normalized  utility  function. 

(b)  Show  how  to  transform  U into  V with  a positive  affine  transformation  of 
the  form  xi— >ax  + b with  <7,6  eM  and  a > 0. 


Exercise  4.11.  Consider  the  following  lotteries:  Z3  = 


T $28^ 


V 1 J 


L,= 


$10  $50 


V 2 


2 J 


(a)  Ann  has  the  following  von  Neumann-Morgenstern  utility  function: 
U Ann(m ) = ■ How  does  she  rank  the  two  lotteries? 


(b)  Bob  has  the  following  von  Neumann-Morgenstern  utility  function: 

4 

Jfl 

U H Am)  = 2m . How  does  he  rank  the  two  lotteries? 

Bob  1003 

(c)  Verify  that  both  Ann  and  Bob  are  risk  averse,  by  determining  what  they 
would  choose  between  lottery  L4  and  its  expected  value  for  certain. 
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4.E.2.  Exercises  for  Section  4.3:  Expected  utility  axioms. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  4.12.  Let  O = {oj,o2,o3,o4}  . Find  the  simple  lottery  corresponding  to 


the  following  compound  lottery 

Y°1  °2  °3  °4:': 

2 j_  1 °2 

V 5 10  10  5/ 

, 8 4 


o,  o. 


i 

V 5 


^4 

5 J 


O,  O- 


w 


I 

V 3 


2 

3 J 


8 2 , 


Exercise  4.13.  Let  O = [o1,o2,o3,o4j . Suppose  that  the  DM  has  a von 
Neumann-Morgenstern  ranking  of  £(0)  and  states  the  following  indifference: 


o,  o. 


and 


as  L = 


V 4 4 . 

0,  0. 


0,  04 


. Find  a lottery  that  the  DM  considers  just  as  good 


000000000000 


Exercise  4.14:  Challenging  Question. 

Would  you  be  willing  to  pay  more  in  order  to  reduce  the  probability  of  dying 
within  the  next  hour  from  one  sixth  to  zero  or  from  four  sixths  to  three  sixths? 
Unfortunately,  this  is  not  a hypothetical  question:  you  accidentally  entered  the 
office  of  an  Italian  mad  scientist  and  have  been  overpowered  and  tied  to  a chair. 
The  mad  scientist  has  put  six  glasses  in  front  of  you,  numbered  1 to  6,  and  tells 
you  that  one  of  them  contains  a deadly  poison  and  the  other  five  a harmless 
liquid.  He  says  that  he  is  going  to  roll  a die  and  make  you  drink  from  the  glass 
whose  number  matches  the  number  that  shows  up  from  the  rolling  of  the  die. 
You  beg  to  be  exempted  and  he  asks  you  “what  is  the  largest  amount  of  money 
that  you  would  be  willing  to  pay  to  replace  the  glass  containing  the  poison 
with  one  containing  a harmless  liquid?”.  Interpret  this  question  as  “what  sum  of 
money  x makes  you  indifferent  between  (1)  leaving  the  poison  in  whichever 
glass  contains  it  and  rolling  the  die,  and  (2)  reducing  your  wealth  by  $x  and 
rolling  the  die  after  the  poison  has  been  replaced  by  a harmless  liquid”.  Your 
answer  is:  $X  Then  he  asks  you  “suppose  that  instead  of  one  glass  with  poison 
there  had  been  four  glasses  with  poison  (and  two  with  a harmless  liquid);  what 
is  the  largest  amount  of  money  that  you  would  be  willing  to  pay  to  replace  one 
glass  with  poison  with  a glass  containing  a harmless  liquid  (and  thus  roll  the  die 
with  3 glasses  with  poison  and  3 with  a harmless  liquid)?”.  Your  answer  is:  $Y. 
Show  that  if  X > Y then  you  do  not  satisfy  the  axioms  of  Expected  Utility 
Theory.  [Hint:  think  about  what  the  basic  outcomes  are;  assume  that  you  do 
not  care  about  how  much  money  is  left  in  your  estate  if  you  die  and  that,  when 
alive,  you  prefer  more  money  to  less.] 
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Appendix  4.S:  Solutions  to  exercises 


Exercise  4.1.  The  expected  value  of  the  lottery 

^ (24)  + ( 1 2)  + A (48)  + f (6)  = 1 8 

6 6 6 6 


"24  12  48  6^ 

1111 
V 6 6 6 6 J 


is 


Exercise  4.2.  This  was  a trick  question!  There  is  no  expected  value  because  the 
basic  outcomes  are  not  numbers. 


Exercise  4.3.  (a)  The  expected  value  is 

E[Z]  = £(1 0)  + f2  (1 5)  + (0)(1 8)  + £ (20)  + £(25)  + (0)(30)  + £(36)  = ^ = $2 1 .9 1 7 


(b)  Presumably  Ann  prefers  more  money  to  less,  so  that  she  prefers  $21,917  to 
$20.  She  said  that  she  prefers  $20  to  the  lottery  L.  Thus,  assuming  that  her 
preferences  are  transitive,  she  prefers  $21,917  to  L.  Hence  she  is  risk  averse. 

(c)  The  answer  is:  we  cannot  tell.  First  of  all,  since  Bob  prefers  more  money  to 
less,  he  prefers  $21,917  to  $20.  Bob  could  be  risk  neutral,  because  a risk 
neutral  person  would  be  indifferent  between  L and  $21,917;  since  Bob 
prefers  $21,917  to  $20  and  has  transitive  preferences,  if  risk  neutral  he 
would  prefer  L to  $20  for  certain.  However,  Bob  could  also  be  risk  loving:  a 
risk-loving  person  prefers  L to  $21,917  and  we  know  that  he  prefers  $21,917 
to  $20;  thus,  by  transitivity,  if  risk  loving,  he  would  prefer  L to  $20  for 
certain.  But  Bob  could  also  be  risk  averse:  he  could  consistently  prefer 
$21,917  to  L and  L to  $20  (for  example,  he  could  consider  L to  be  just  as 
good  as  $20.50). 

Exercise  4.4.  Just  like  Exercise  4.2,  this  was  a trick  question!  Here  the  basic 
outcomes  are  not  sums  of  money  but  states  of  health.  Since  the  described 
choice  is  not  one  between  money  lotteries,  the  definitions  of  risk 

aversion/ neutrality/love  are  not  applicable. 

Exercise  4.5.  Since  Ben  prefers  B to  A,  he  must  prefer  D to  C. 

Proof:  Let  U be  a von  Neumann-Morgenstern  utility  function  that  represents 
Ben's  preferences.  Let  U($4,000)  = a,  U($3,000)  = b and  U($0)  = c.  Since  Ben 
prefers  more  money  to  less,  a > b > c.  Now, 

EU (A)  = 0.8  U ($4, 000)  + 0.2  U ($0)  = 0.8a  + 0.2c  and  EU(B)  = H($3,000)  = b . 
Since  Ben  prefers  B to  A,  it  must  be  that  b > 0.8a  + 0.2c  . Let  us  now  compare 
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C and  D:  EU(C)  = 0.2  a + 0.8c  and  EU(D)  = 0.25b  + 0.75c.  Since 

6 > 0.8fl  + 0.2c  , 0.256  > 0.25(0.8(7  + 0.2c)  = 0.2a  + 0.05c  and  thus,  adding  0.75c 
to  both  sides,  we  get  that  0.256  + 0.75c  > 0.2  a + 0.8c , that  is,  EU{D ) > EU(C), 
so  that  D >~  C. 


Exercise  4.6.  Define  the  function  U as  follows: 


u(x)  - c -n*)-c 


a-c 


a-c  a-c 


Then  U represents  the  same  preferences  as  V (note  that,  by  hypothesis,  a > c 
and  thus  yy  > 0).  Then 


U(o2)  = 


U (o3)  = 


V(o2)-c  a-c  1 V(ol)-c_b-c 

a-c  a-c  1 


, and 


a-c  a-c 


V(o3)-c  c-c 


= 0. 


a-c  a-c 


b-c 


Note  that,  since  a > 6 > c , 0 < < 1 . 

a-c 


Exercise  4.7.  Suppose  that  there  is  a von  Neumann-Morgenstern  utility 
function  U that  represents  Jennifer’s  preferences.  We  can  normalize  it  so  that 
6/($3000)  = 1 and  U($500)  = 0.  Since  Jennifer  is  indifferent  between  Ly  and 

5 

$2000,  U($2 000)  = y.  Since  she  is  indifferent  between  L2  and  $1000, 

C7($  1000)  = y.  Thus  EU(L3) 


1 , x 1 1 

( 5j  1 | 

(2']  1 , . 5 1 1 

( 5) 

l 1 1 

( 2j 

4 W+  4 1 

V6  J+  4I 

vyJ+  y(o)  - y and  sum  - o(i)+yH 

16; 

l+yl 

U J 

+ 0(0)  = ^ . Since  4 > g , Jennifer  should  prefer  L4  over  Ly  Hence  she  is 
not  rational  according  to  the  theory  of  expected  utility. 


Exercise  4.8.  Normalize  her  utility  function  so  that  U(z^)  = 1 and  t/(z4)  = 0. 

),  we  have  that 


( oS\ 

( 0 , 

0.  J 

Then,  since  Rachel  is  indifferent  between 

10 

and 

i h- 

A 1 

V 5 

5 J 

U{o 2)  = j . Similarly,  since  she  is  indifferent  between 


^ O 3^ 

( 0, 

0 .,  J 

and 

1 + 

1 1 

V 2 

2 y 
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U(o})  = j . Then  the  expected  utility  of  Lx  = 


fo1  o2  o3  o4^ 

llll 
8 8 8 8 J 


IS 


j(l)  + |-(j)  + |(-j)  + f (0)  = ^ = 0.5125,  while  the  expected  utility  of 

\ 


^2  ~ 


I 

v 5 


i 

5 j 


is  j(1)  + j(j)  + j(-j)  = . 11  = 0.78.  Hence  she  prefers  L2 


to  Lu 


Exercise  4.9.  (a)  The  expected  value  of  L2  is  ^-(30) + ^(28) + ^8  = 18.2  . The 
expected  value  of  Lx  is  ^-(30) + ^-(28) + ^-(24) + ^-(18) + ^-(8)  = 18  . Hence  a risk- 
neutral  person  would  prefer  Z2to  Lx. 

(b)  The  expected  utility  of  L0  is  — Tn(30)  + --ln(28)  + — • ln( 8)  = 2.713  while  the 

expected  utility  of  L.  is  -ln(30)  + — -ln(28)  + — - ln( 24)  + — • ln(  1 8)  + — - ln( 8)  = 2.741i. 

r 7 1 5 10  10  5 5 

Thus  Paul  prefers  Lx  to  Lr 


Exercise  4.10.  (a)  To  normalize  U first  add  10  to  each  value  and  then  divide  by 
180.  Denote  the  normalization  of  Uby  U . Then 

°\  °2  °3  °4  °5 

77  • Jl  = 03  180  1 _0_  A J6__07  108  _ A A 

V • 180  180  1 180  J 180  u-z-  180  w ^ ) 

To  normalize  V first  subtract  5 from  each  value  and  then  divide  by  100.  Denote 
the  normalization  of  Hby  V . Then 


V: 


= 0 3 

90 


90  1 

90  1 


JL  = 0 

90  u 


= 0 2 

90  u-z‘ 


1 = 0-6  j 


(b)  The  transformation  is  of  the  form  V (o)  = aU (o)  + b.  To  find  the  values  of  a 
and  b plug  in  two  sets  of  values  and  solve  the  system  of  equations 
[44a  + b = 32 

. The  solution  is  a = b = 10 . Thus  V(o)  = jU(o)  + 10 . 


< 


ITOa  + b = 95 


Exercise  4.11.  (a)  Ann  prefers  Z3  toZ4.  In  fact,  IE [UAl m (Z3 )]  = V28  = 5.2915 
while  E[UaJL4)\  = a/To  +y  V50  = 5.1 167  . 


284 

(b)  Bob  prefers  Z4  toZ3.  In  fact,  1B[H^(Z3)]  = 2(28)  - =55.38534  while 


E[V»(4)]  = 


2(10)- 


IQ4 

Too4 


+ - 


1 


2(50)- 


50 

1003 


= 56.87. 
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(c)  The  expected  value  of  lottery  L4  is  $30;  thus  a risk-averse  person  would 
strictly  prefer  $30  with  certainty  to  the  lottery  L4 . We  saw  in  part  (a)  that 
for  Ann  the  expected  utility  of  lottery  L4  is  5.1167;  the  utility  of  $30  is 

V30  =5.47723.  Thus  Ann  would  indeed  choose  $30  for  certain  over  the 
lottery  L4.  We  saw  in  part  (b)  that  for  Bob  the  expected  utility  of  lottery 

30 ; 

L.  is  56.87;  the  utility  of  $30  is  2(30) — = 59.19  . Thus  Bob  would 

4 7 1003 


indeed  choose  $30  for  certain  over  the  lottery  L4 . 


Exercise  4.12.  The  simple  lottery  is 


18 
V 240 


103 

240 


95 

240 


24 

240  J 


For  example,  the 


probability  of  o2  is  computed  as  follows:  |(^)  + \ 00  + i(0)  + 2 {3)  = 240  • 


Exercise  4.13.  Using  the  stated  indifference,  use  lottery  L to  construct  the 


compound  lottery 
lottery  is  L'  - 


o ~ o, 


2 ^4 

1 1 

v 4 4 j 

1 

3 


ox  a 


3 ^4 

3 l 
V 5 5 J 


O-x  0, 1 


1 

3 J 


rol  02  o3  o4^ 

A 1 11  121 

U 12  45  180  J 


whose  corresponding  simple 
. Then,  by  the  Independence  Axiom,  L ~ L' . 


Exercise  4.14.  Let  W be  your  initial  wealth.  The  basic  outcomes  are: 

1.  you  do  not  pay  any  money,  don’t  die  and  live  to  enjoy  your  wealth  W 
(denote  this  outcome  by  A^, 

2.  you  pay  $Y,  don’t  die  and  live  to  enjoy  your  remaining  wealth  W -Y  (call 
this  outcome  Ay), 

3.  you  pay  $X,  don’t  die  and  live  to  enjoy  your  remaining  wealth  W -X  (call 
this  outcome  Az), 

4.  you  die  (call  this  outcome  D );  this  could  happen  because  (a)  you  do  not  pay 
any  money,  roll  the  die  and  drink  the  poison  or  (b)  you  pay  $Y,  roll  the  die 
and  drink  the  poison;  we  assume  that  you  are  indifferent  between  these  two 
outcomes. 

Since,  by  hypothesis,  X > Y , your  ranking  of  these  outcomes  must  be 
A()  y Ay  y Ax  y D . If  you  satisfy  the  von  Neumann-Morgenstern  axioms,  then 
your  preferences  can  be  represented  by  a von  Neumann-Morgenstern  utility 
function  U defined  on  the  set  of  basic  outcomes.  We  can  normalize  your  utility 
function  by  setting  U (A0)  = 1 and  U (D)  = 0 . Furthermore,  it  must  be  that 

U(Ay)>U(Ax)  (v) 
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The  maximum  amount  p that  you  are  willing  to  pay  is  that  amount  that  makes 
you  indifferent  between  (f)  rolling  the  die  with  the  initial  number  of  poisoned 
glasses  and  (2)  giving  up  $p  and  rolling  the  die  with  one  less  poisoned  glass. 
Thus  - based  on  your  answers  - you  are  indifferent  between  the  two  lotteries 


D An 


I 

V 6 

^ D 

4 

V 6 


_5 

6 J 


A 


6 J 


and 


and 


'V 

V 1 J 

( D 

3 

V6 


and  you  are  indifferent  between  the  two  lotteries: 


Av 


6 j 


Thus  \U(D)  + ^U(A0)  = U(AX)  and 


=4o+ji=f 


\U(D)  + W(Ad)  = W(D)  + W(AY).  Hence  U(AX)  = | and  U(Ay)  = } = ±,  so 


=40+41=4 


=|0+|  U(Ar) 


that  U ( Ax ) > U (Ay)  , contradicting  (v). 
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Chapter 


Mixed  Strategies  in 
Strategic-Form  Games 

5.1  Strategic-form  games  with  cardinal  payoffs 

At  the  end  of  Chapter  3 we  discussed  the  possibility  of  incorporating 
random  events  in  extensive-form  games  by  means  of  chance  moves. 
The  introduction  of  chance  moves  gives  rise  to  probabilistic  outcomes 
and  thus  to  the  issue  of  how  a player  might  rank  such  outcomes.  Random 
events  can  also  occur  in  strategic-form  games,  as  the  following  example 
illustrates. 

Example  5.1.  Consider  the  following,  very  simple,  first-price  auction.  Two 
players  simultaneously  submit  a bid  for  a painting.  Only  two  bids  are  possible: 
$100  and  $200.  If  one  player  bids  $200  and  the  other  $100  then  the  high  bidder 
wins  the  painting  and  has  to  pay  her  own  bid.  If  the  two  players  bid  the  same 
amount  then  a fair  coin  is  tossed  and  if  the  outcome  is  Heads  the  winner  is 
Player  1 (who  then  has  to  pay  her  own  bid)  while  if  the  outcome  is  Tails  the 
winner  is  Player  2 (who  then  has  to  pay  her  own  bid).  Define  the  following 
basic  outcomes: 


Oj : Player  1 gets  the  painting  and  pays  $100 

o2 : Player  2 gets  the  painting  and  pays  $100 
o3 : Player  2 gets  the  painting  and  pays  $200 
o4  : Player  1 gets  the  painting  and  pays  $200 

Then  this  situation  can  be  represented  by  means  of  the  game-frame  in  strategic 
form  shown  in  Table  5.1. 
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bid 

$100 

Player  1 

bid 

$200 


where 

o1 : Player  1 gets  the  painting  and  pays  $100 

o2 : Player  2 gets  the  painting  and  pays  $100 
o3 : Player  2 gets  the  painting  and  pays  $200 
o4 : Player  1 gets  the  painting  and  pays  $200 

Table  5.1 

A game-frame  in  strategic  form  representing  Example  5.1 


Player  2 

bid  $100  bid  $200 


' °1  °2^ 

1 1 

V 2 2 J 

o3 

°4 

(°  3 

1 1 
V 2 2 J 

3 ’ 


Suppose  that  Player  1 ranks  the  basic  outcomes  as  follows:  o1  >-  o4  >-  o2 
that  is,  she  prefers  winning  to  not  winning;  conditional  on  winning,  she  prefers 
to  pay  less  and,  conditional  on  not  winning,  she  is  indifferent  as  to  how  much 
Player  2 pays.  Suppose  also  that  Player  1 believes  that  Player  2 is  going  to 
submit  a bid  of  $100  (perhaps  she  has  been  informed  of  this  by  somebody 
spying  on  Player  2).  What  should  we  expect  Player  1 to  do?  Knowing  her 
ranking  of  the  basic  outcomes  is  of  no  help,  because  we  need  to  know  how  she 


ranks  the  probabilistic  outcome 


O.  O') 


relative  to  the  basic  outcome  o. 


V 2 2 7 

The  theory  of  expected  utility  introduced  in  Chapter  4 provides  one  possible 
answer  to  the  question  of  how  players  rank  probabilistic  outcomes.  With  the 
aid  of  expected  utility  theory  we  can  now  generalize  the  definition  strategic- 
form  game.  First  we  generalize  the  notion  of  game-frame  in  strategic  form 
(Definition  1.1,  Chapter  1)  by  allowing  probabilistic  outcomes,  or  lotteries,  to 
be  associated  with  strategy  profiles.  In  the  following  definition,  the  bulleted 
items  coincide  with  the  first  three  items  of  Definition  1.1  (Chapter  1);  the 
modified  item  is  the  last  one,  preceded  by  the  symbol  ♦ . 
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Definition  5.1.  A game-frame  in  strategic  form  is  a quadruple  (ffSf).^  ,0,f) 
where: 

• I = {1  is  a set  of  players  ( n > 2). 

• For  every  Player  i e I , Sj  is  the  set  of  strategies  (or  choices)  of  Player  i.  As 
before,  we  denote  by  S = Sl  x...xSn  the  set  of  strategy  profiles. 

• O is  a set  of  basic  outcomes. 

• / : S — » £(<9)  is  a function  that  associates  with  every  strategy  profile  s a 
lottery  over  the  set  of  basic  outcomes  O (as  in  Chapter  4,  we  denote  by 
£{0)  the  set  of  lotteries,  or  probability  distributions,  over  O). 

If,  for  every  s e S , f(s)  is  a degenerate  lottery  (that  is,  a basic  outcome)  then 
we  are  back  to  Definition  1.1  (Chapter  1). 

From  a game-frame  one  obtains  a game  by  adding,  for  every  player  i e / , a von 
Neumann-Morgenstern  ranking  . of  the  elements  of  X(O) . It  is  more 
convenient  to  represent  such  a ranking  by  means  of  a von  Neumann- 
Morgenstern  utility  function  Ui  :0— > BL . We  denote  by  IE[t/(.(/(s))]  the 
expected  utility  of  lottery  / ( s ) e £(0 ) for  Player  i.  The  following  definition 
mirrors  Definition  1.2  of  Chapter  1. 

Definition  5.2.  A game  in  strategic  form  with  cardinal  payoffs  is  a quintuple 

ih) where: 

• ^/,(5(.).g/  ,0,f  ^ is  a game-frame  in  strategic  form  (Definition  5.1)  and 

• for  every  Player  z e / , ^ . is  a von  Neumann-Morgenstern  ranking  of  the 
set  of  lotteries  £{0) . 

If  we  represent  each  ranking  it  . by  means  of  a von  Neumann-Morgenstern 
utility  function  £/.  and  define  ni : S — » M by  ^.(s)  = E[t/;  (/(.?))] , then  the 
triple  is  called  a reduced-form  game  in  strategic  form 

with  cardinal  payoffs  (‘reduced-form’  because  some  information  is  lost,  namely 
the  specification  of  the  possible  outcomes).  The  function  nt : S — » M is  called 
the  von  Neumann-Morgenstern  payoff  function  of  Player  i. 

For  example,  consider  the  first-price  auction  of  Example  5.1  whose  game-frame 
in  strategic  form  was  shown  in  Table  5.1.  Let  (9  = {OpO-^c^cq}  and  suppose 
that  Player  1 has  a von  Neumann-Morgenstern  ranking  of  £(0)  that  is 
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represented  by  the  following  von  Neumann-Morgenstern  utility  function  U] 
(note  that  the  implied  ordinal  ranking  of  the  basic  outcomes  is  indeed 
Oj  >o4>o2~o3): 


outcome : o,  o2  o3  o4 


Ux:  4112 

Then,  for  Player  1,  the  expected  utility  of  lottery 


(°X 

J_  1 

V 2 2 J 


is  2.5  and  the 


expected  utility  of  lottery 


fo3  o4 ^ 

_1_  1 
V 2 2 J 


is  1.5.  Suppose  also  that  Player  2 has 


(somewhat  spiteful)  preferences  represented  by  the  following  von  Neumann- 
Morgenstern  utility  function  U2 : 


outcome : o,  o2  o3  o4 


U 2 : 


16  4 5 

Thus,  for  Player  2,  the  expected  utility  of  lottery 


I 

V 2 


\ 

2 y 


f(h  o4 ' 


I 

V 2 


1 

2 y 


expected  utility  of  lottery 
reduced  form  as  shown  in  Table  5.2. 


is  3.5  and  the 
is  4.5.  Then  we  can  represent  the  game  in 


$100 

Player  1 

$200 


Table  5.2 

A cardinal  game  in  reduced  form  based  on  the  game-frame  of  Table  5.1 


Player  2 

$100  $200 


2.5  3.5 

1 4 

2 5 

1.5  4.5 

The  game  of  Table  5.2  does  not  have  any  Nash  equilibria.  However,  we  will 
show  in  the  next  section  that  if  we  extend  the  notion  of  strategy,  by  allowing 
players  to  choose  randomly,  then  the  game  of  Table  5.2  does  have  a Nash 
equilibrium. 

Test  your  understanding  of  the  concepts  introduced  in  this  section,  by  going 
through  the  exercises  in  Section  5.E.1  of  Appendix  5.E  at  the  end  of  this  chapter. 
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5.2  Mixed  strategies 

Definition  5.3.  Consider  a game  in  strategic  form  with  cardinal  payoffs  and 
recall  that  denotes  the  set  of  strategies  of  Player  i.  From  now  on,  we  shall 
call  Sj  the  set  of  pure  strategies  of  Player  i.  We  assume  that  Si  is  a finite  set  (for 
every  i e I).  A mixed  strategy  of  Player  i is  a probability  distribution  over  the 
set  of  pure  strategies  Sj . The  set  of  mixed  strategies  of  Player  i is  denoted  by  E;. . 

Remark  5.1.  Since  among  the  mixed  strategies  of  Player  i there  are  the 
degenerate  strategies  that  assign  probability  1 to  a pure  strategy,  the  set  of 
mixed  strategies  includes  the  set  of  pure  strategies  (viewed  as  degenerate 
probability  distributions). 


For  example,  one  possible  mixed  strategy  for  Player  1 in  the  game  of  Table  5.2 

''*100  $200^  . , . , 

. i he  traditional  interpretation  or  a mixed  strategy  is  in  terms 


is 


V 3 


2 

3 J 


of  objective  randomization:  the  player,  instead  of  choosing  a pure  strategy 

4 

herself,  delegates  the  choice  to  a random  device.  For  example,  Player  1 

, . , . , ($100  $200T  . . 

choosing  the  mixed  strategy  1 is  interpreted  as  a decision  to  let, 

V I T ) 

say,  a die  determine  whether  she  will  bid  $100  or  $200:  Player  1 will  roll  a die 
and  if  the  outcome  is  1 or  2 then  she  will  bid  $100,  while  if  the  outcome  is  3,  4, 
5 or  6 then  she  will  bid  $200.  Suppose  that  Player  1 chooses  this  mixed  strategy 

, ^ , r$100  $20CC 

and  Player  2 chooses  the  mixed  strategy 


3 

V 5 


2 

5 J 


. Since  the  players  rely 


on  independent  random  devices,  this  pair  of  mixed  strategies  gives  rise  to  the 
following  probabilistic  outcome: 


(strategy  profile  ($100,  $100)  ($100,  $200)  ($200,  $100)  ($200,  $200)^ 


outcome 

O ^ 

1 1 1 

o3 

°a 

^3  <0 

1 1 1 

probability 

V 2 2 ) 

1 3 _ 3 
3 5 15 

1 2 _ 2 
3 5 15 

2 3 _ 6 

3 5 15 

V 2 2 ) 

2 2 _ 4 

3 5 15 

) 

4 An  alternative  interpretation  of  mixed  strategies  in  terms  of  beliefs  will  be  discussed  in  a later 
chapter. 
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If  the  two  players  have  von  Neumann-Morgenstern  preferences,  then  - by  the 
Compound  Lottery  Axiom  (Chapter  4)  - they  will  view  the  above  as  the 
following  lottery: 

r outcome  ox  o2  o3  oY 

v probability  £ 1o  30  f / 

Using  the  von  Neumann-Morgenstern  utility  functions  postulated  above, 


outcome : 

°1 

°2  °3  °4 

f°\  °2  °2>  °4  " 
3 3 8 16 

namely  Ul : 

4 

112,  the  lottery 

has  an 

U2 : 

1 

6 4 5 

V 30  30  30  30 

expected  utility 

of 

J_4  + J_l  + j_1  + I62  = 55 
30  ' ~ 30  1 ' 30  1 ' 30  30 

for  Player 

1 and 

J_l  + j_6  + _8_4  + i6.5 

30  J-  T 30  o -r  30  -r  -r  30 

_ 133 
30 

for  Player  2.  Thus  we  can 

define  the  payoffs  of  the 

two  players  from  this  mixed  strategy  profile  by 


Y $100 

$200^1 

TlOO 

$200^ 

) 55  ( 

= — and  LL 

'$100 

N) 

O 

O 

o 

o 

$200 V 

I 1 

9 

3 

2 

30  2 

J V 

1 

2 

3 

2 

W 3 

3 y 

V 5 

5 A 

A 3 

3 J 

V 5 

5 JJ 

Note  that  we  can  calculate  these  payoffs  in  a different  - but  equivalent  - way 
by  using  the  reduced-form  game  of  Table  5.2,  as  follows. 

f strategy  profile  ($100,  $100)  ($100,  $200)  ($200,  $100)  ($200,  $200 Y 
expected  utilities  (2.5, 3.5)  (1,4)  (2,5)  (1.5, 4. 5) 

V probability  if  = n TT  = n lt  = TJ  iH  y 

so  that  the  expected  payoff  of  Player  1 is  ^-(2.5)  + ^-(l)  + ^-(2)  + ^-(1.5)  = ■§§■  and 
the  expected  payoff  of  Player  2 is  ^(3.5)  + -jj(4)  + -jj(5)  + -jj(4.5)  . 

The  above  example  motivates  the  following  definition.  First  some  notation.  Let 
cr.  e Z.  be  a mixed  strategy  of  player  i;  then,  for  every  pure  strategy  y e Sj  of 

Player  i,  we  denote  by  cr.(y)  the  probability  that  cr;  assigns  to  y .5  Let  Z be 
the  set  of  mixed-strategy  profiles,  that  is,  Z = Zj  x...xZ;i.  Consider  a mixed- 
strategy  profile  cr  = (cr15 ...,  crj;)  e Z and  a pure-strategy  profile  s = (y,...,y,)  e S ; 


5 In  the  above  example,  if  a 


Tioo 

1 
V 3 


$200]  then  Cr1($200)  = f - 
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then  we  denote  by  cr(s)  the  product  of  the  probabilities  cr(y),  that  is, 

n 6 

a(s)  = Y[  ist  ) = arl{si)x...xcrn  (sn ) . 

1=1 

Definition  5.4.  Consider  a reduced-form  game  in  strategic  form  with  cardinal 
payoffs  G = (lfSx,...,Sn)fKx,...,7rnfj  (Definition  5.2),  where,  for  every  Player 

i e I , the  set  of  pure  strategies  S'.  is  finite.  Then  the  mixed-strategy  extension  of 
G is  the  reduced-form  game  in  strategic  form  (/,(X1,...,Xn),(ni,...,nn))  where, 
for  every  Player  i e I , 

• X;  is  the  set  of  mixed  strategies  of  Player  i in  G (that  is,  X;  is  the  set  of 
probability  distributions  over  Sf 

• The  payoff  function  n; : X — » M is  defined  by  TT;  (cr)  = X <r(s)K,(s)  . 

seS 


Definition  5.5.  Fix  a reduced-form  game  in  strategic  form  with  cardinal 
payoffs  G = (l,(Sl,...,Sn),(7rl,...,7rn))  (Definition  5.3),  where,  for  every  z'e/, 

the  set  of  pure  strategies  Sj  is  finite.  A Nash  equilibrium  in  mixed-strategies  of  G 
is  a Nash  equilibrium  of  the  mixed-strategy  extension  of  G. 


is 


For  example,  consider  the  reduced-form  game  of  Table  5.2,  which 
reproduced  in  Table  5.3  below  with  all  the  payoffs  multiplied  by  10  (this 
corresponds  to  representing  the  preferences  of  the  players  with  different  utility 
functions  that  are  a obtained  from  the  ones  used  above  by  multiplying  them  by 


10).  Is  cr  = (cr,,cr2)  with  <Jl  = 
Nash  equilibrium  of  this  game? 


TlOO  $20(p 

and  cr2  = 

Tioo 

$20(P 

1 2 

3 

2 

V 3 3 J 

V 5 

5 y 

a mixed-strategy 


In  the  above  example,  if  a = (cr, , cr, ) with  cr,  = 


$100  $200 


V 3 


and 


$100  $200 ) 


3_ 

V 5 


2 

5 J 


then 


cr, ($200)  = | , cr2($100)  = f and  thus  cr(($200,$100))  = f}  = £. 
In  the  above  example,  if  cr,  = 


f $100  $20o3 

f$100  $200N 

1 . 

and  o-2=  3 2 

V 3 3 J 

V 5 5 J 

then  n,(<7„<72)  = 


T(2.5)  + T(l)  + 1|(2)  + T(1.5)  = §' 
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Player  1 


Player  2 


$100 

$200 

$100 

25 

35 

10 

40 

$200 

20 

50 

15 

45 

Table  5.3 

The  game  of  Table  5.2  with  the  payoffs  multiplied  by  10 


The  payoff  of  Player  1 is  YIx(<jx,<j2)  = ^-(25) + ^-(10) + ^-(20) + ^(15)  = -y 


= 18.33 . If  Player  1 switched  from  cr. 


GlOO  $200^ 

TlOO 

$200^ 

1 2 

V 3 3 ) 

to  d,  = 

, 1 

0 2 

, that  is,  to 


the  pure  strategy  $100,  then  Player  l’s  payoff  would  be  larger: 
ni(d-1,o-2)  = |(25)  + |(10)  = 19.  Thus  (cr^cr.,)  is  not  a Nash  equilibrium. 


John  Nash,  who  shared  the  1994  Nobel  prize  in  economics  with  John  Harsanyi 
and  Reinhard  Selten,  proved  the  following  theorem. 

Theorem  5.1  [Nash,  1951].  Every  reduced-form  game  in  strategic  form  with 
cardinal  payoffs  (Definition  5.2),  where,  for  every 

Player  z'e/,  the  set  of  pure  strategies  St  is  finite,  has  at  least  one  Nash 
equilibrium  in  mixed-strategies. 


We  will  not  give  the  proof  of  this  theorem,  since  it  is  rather  complex  (it 
requires  the  use  of  fixed-point  theorems). 


Going  back  to  the  game  of  Table  5.3,  let  us  verify  that,  on  the  other  hand, 


* / * * \ .i  * * 

cr  =lcr,,cr2l  with  cr,  =cr2  = 


GlOO  $20(P 


V 2 


is  a Nash  equilibrium  in  mixed 


2 y 


strategies.  The  payoff  of  Player  1 is 

ni(a1‘,c7;)  = |(25)  + {(10)  + |(20)  + |(15)=f  = 17.5. 

Could  Player  1 obtain  a larger  payoff  with  some  other  mixed  strategy 


GlOO  $200  ^ 


cr,  = 


1 -Pj 


for  some  p ^ J?  Fix  an  arbitrary  p e [0,1]  and  let  us  compute 


193 


GAME  THEORY  - Giacomo  Bonanno 


Player  l’s  payoff  if  she  uses  the  strategy  = 
mixed  strategy  ai  = 


NlOO  $200  ^ 


1 ~P 


against  Player  2’s 


NlOO  $20(P 


V 2 


1 

2 y 


n, 


$100  $200 
VV  P l~P) 


^ f $100  $200^ 


1 

2 


p(2S)  + \p(W)  + \(\- p)(  20)  + } (1  - p){\  5)  = 


V 2 2 J ) 

^(j25  + |10)  + (l-^)(i20  + jl5)  = f = 17.5 

Thus  if  Player  2 uses  the  mixed  strategy  o-2*  = 


NlOO  $200^ 


V 2 


1 

2 y 


, then  Player  1 gets  the 


same  payoff  no  matter  what  mixed  strategy  she  employs.  It  follows  that  any  mixed 


strategy  of  Player  1 is  a best  reply  to  o\*  = 


NlOO  $20(P 


V 2 


in  particular, 


2 y 


NlOO  $20(P 


O',  = 


V 2 


2 


is  a best  reply  to  °-2 


NlOO  $20(P 


V 2 


It  is  easy  to  verify  that  the 


2 y 


same  applies  to  Player  2:  any  mixed  strategy  of  Player  2 is  a best  reply  to  Player 


NlOO  $200^ 


l’s  mixed  strategy  crI  = 
equilibrium  in  mixed  strategies. 


1 
V 2 


1 

2 y 


. Hence  a*  = (cr*,<j*2)  is  indeed  a Nash 


We  will  see  in  the  next  section  that  this  “indifference”  phenomenon  is  true  in 
general. 


Remark  5.2.  Since,  among  the  mixed  strategies  of  Player  i there  are  the 
degenerate  strategies  that  assign  probability  1 to  a pure  strategy  (Remark  5.5), 
every  Nash  equilibrium  in  pure  strategies  is  also  a Nash  equilibrium  in  mixed 
strategies.  That  is,  the  set  of  mixed-strategy  Nash  equilibria  includes  the  set  of 
pure-strategy  Nash  equilibria. 

Test  your  understanding  of  the  concepts  introduced  in  this  section,  by  going 
through  the  exercises  in  Section  5.E.2  of  Appendix  5.E  at  the  end  of  this  chapter. 
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5.3  Computing  the  mixed-strategy  Nash  equilibria 

How  can  we  find  the  mixed-strategy  equilibria  of  a given  game?  The  first 
important  observation  is  that  if  a pure  strategy  is  strictly  dominated  by  another 
pure  strategy  then  it  cannot  be  played  with  positive  probability  at  a Nash 
equilibrium.  Thus,  for  the  purpose  of  finding  Nash  equilibria,  one  can  delete  all 
the  strictly  dominated  strategies  and  focus  on  the  resulting  game.  But  then  the 
same  reasoning  applies  to  the  resulting  game  and  one  can  delete  all  the  strictly 
dominated  strategies  in  that  game,  and  so  on.  Thus  we  have  the  following 
observation. 

Remark  5.3.  In  order  to  find  the  mixed-strategy  Nash  equilibria  of  a game  one 
can  first  apply  the  iterated  deletion  of  strictly  dominated  strategies  (IDSDS: 
Section  1.5.1,  Chapter  1)  and  then  find  the  Nash  equilibria  of  the  resulting 
game  (which  can  then  be  viewed  as  Nash  equilibria  of  the  original  game  where 
all  the  pure  strategies  that  were  deleted  are  assigned  zero  probability).  Note, 
however,  that  - as  we  will  see  in  Section  5.4  - one  can  perform  more  deletions 
than  allowed  by  the  IDSDS  procedure. 

For  example,  consider  the  game  of  Table  5.4. 

Player  2 


Table  5.4 

A reduced-form  game  with  cardinal  payoffs 

In  this  game  there  are  no  pure-strategy  Nash  equilibria;  however,  by  Nash’s 
theorem  there  will  be  at  least  one  mixed-strategy  equilibrium.  To  find  it  we  can 
first  note  that,  for  Player  1,  D is  strictly  dominated  by  B ; deleting  D we  get  a 
smaller  game  where,  for  Player  2,  G is  strictly  dominated  by  F.  Deleting  G we 
are  left  with  a smaller  game  where  A is  strictly  dominated  by  C.  Deleting  A we 
are  left  with  the  game  shown  in  Table  5.5. 
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Player 

1 


B 

C 


Player  2 
E F 

4 0 I 2 4 

3 3 4 


Table  5.5 


The  result  of  applying  the  IDSDS  procedure  to  the  game  of  Table  5.4. 


We  will  see  below  that  the  game  of  Table  5.5  has  a unique  Nash  equilibrium  in 


mixed  strategies  given  by 


7 B 

C1 

' E 

fV 

1 

4 

2 

1 

VV  5 

5 y 

V 3 

3 )) 

. Thus  the  game  of  Table  5.4  has 


a unique  Nash  equilibrium  in  mixed  strategies  given  by 


7 A B 

c 

1)3 

'E  F 

gY| 

X°  i 

4 

5 

o. 

? 

2 1 

^ 3 3 

°l 

Once  we  have  simplified  the  game  as  suggested  in  Remark  5.3,  in  order  to  find 
the  mixed-strategy  Nash  equilibria  we  can  use  the  following  result.  First  we 
recall  some  notation  that  was  introduced  in  Chapter  1.  Given  a mixed-strategy 
profile  a = (cr1,...,cr/l)  and  a Player  i,  we  denote  by  cr_(  = (<T1,...,<TM,<xf+1,...,crn) 

the  profile  of  strategies  of  the  players  other  than  i and  use  (cr,cr_;)  as  an 
alternative  notation  for  cr,  furthermore,  ( r, , cr_f ) denotes  the  result  of  replacing 
<t,.  with  t,  in  a , that  is,  (r^cr.)  = (cTj , . . . , , rt , crM ■ 


Theorem  5.2.  Consider  a reduced-form  game  in  strategic  form  with  cardinal 
payoffs.  Suppose  that  a*  = <j*t)  is  a Nash  equilibrium  in  mixed  strategies. 

Consider  an  arbitrary  Player  i.  Let  n*  = n^cr*)  be  the  payoff  of  Player  i at  this 
Nash  equilibrium  and  let  y.. , sik  e Sj  be  two  pure  strategies  of  Player  i such  that 
a* ( v. ) > 0 and  a*  (sik ) > 0 , that  is,  yy  and  sik  are  two  pure  strategies  to  which 
the  mixed  strategy  a*  of  Player  i assigns  positive  probability.  Then 

, cr*(dj  = IT  . In  other  words,  when  the  other  players  use 
the  mixed-strategy  profile  oT , Player  i gets  the  same  payoff  no  matter  whether 
she  plays  the  mixed  strategy  <r*  or  the  pure  strategy  stj  or  the  pure  strategy  sik . 


The  details  of  the  proof  of  Theorem  5.2  will  be  omitted,  but  the  idea  is  simple: 
if  stj  and  sik  are  two  pure  strategies  to  which  the  mixed  strategy  cr*  of  Player  i 
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assigns  positive  probability  and  n.  > n,((y,  , cr*;)j , then  Player  i can 

increase  her  payoff  from  n*=n.(cr*)  to  a larger  number  by  reducing  the 
probability  of  sik  to  zero  and  adding  that  probability  to  <7*  (yy ) , that  is,  by 
switching  from  a*  to  the  mixed  strategy  cr  obtained  as  follows:  cr  (sik ) = 0 , 
cr  (s.. ) = <7* (s ) + cr  (sik ) and,  for  every  other  si  e Sl , <T(.(y)  = cr*(y  ).  But  this 
would  contradict  the  hypothesis  that  a*  = (cr*,...,cr*)  is  a Nash  equilibrium. 


Let  us  now  go  back  to  the  game  of  Table  5.5,  reproduced  below,  and  see  how 
we  can  use  Theorem  5.2  to  find  the  Nash  equilibrium  in  mixed  strategies. 


Player  2 
E F 


Player  B 
1 C 


We  want  to  find  values  of  p and  q,  strictly  between  0 and  1,  such  that 

is  a Nash  equilibrium.  By  Theorem  5.2,  if  Player  1 

rE  F 


Y B c ' 

'E  F V 

X p x~p) 

U 1 _ *7  Jy 

played  the  pure  strategy  B against 


q 1 -q 


she  should  get  the  same  payoff  as  if 


she  were  to  play  the  pure  strategy  C.  The  former  would  give  her  a payoff  of 
4 q + 2(1  - q ) and  the  latter  a payoff  of  3 q + 4(1  - q) . Thus  we  need  q to  be  such 
that  4 q + 2(1  - q)  = 3 q + 4(1  - q) , that  is,  q = f . When  q = | , both  B and  C give 
Player  1 a payoff  of  y and  thus  any  mixture  of  B and  C would  also  give  the 
same  payoff  of  y . In  other  words,  Player  1 is  indifferent  among  all  her  mixed 


E F 


. Similar 


strategies  and  thus  any  mixed  strategy  is  a best  response  to 

V 3 3/ 

reasoning  for  Player  2 reveals  that,  by  Theorem  5.2,  we  need/?  to  be  such  that 

f B C\ 


Op  + 3(1  - p)  = 4p  + 2(1  - p) , that  is,  p = } . Against 


any  mixed  strategy 


of  Player  2 gives  him  the  same  payoff  of  y ; thus  any  mixed  strategy  of  Player  2 

is  a Nash 


(g  Ci  ( 

'fB 

C1 

'E 

F V 

is  a best  reply  to 

, 4 . It  follows  that 

1 

4 

2 

l T jJ  ^ 

A 5 

5 ) 

VT 

3 JJ 

equilibrium. 
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Remark  5.4.  It  follows  from  Theorem  5.2,  and  was  illustrated  in  the  above 
example,  that  at  a mixed  strategy  Nash  equilibrium  where  Player  i plays  two  or 
more  pure  strategies  with  positive  probability,  Player  i does  not  have  an 
incentive  to  use  that  mixed  strategy:  she  would  get  the  same  payoff  if,  instead  of 
randomizing,  she  played  with  probability  1 one  of  the  pure  strategies  in  the 
support  of  her  mixed  strategy  (that  is,  if  she  increased  the  probability  of  any 

g 

pure  strategy  from  a positive  number  to  1).  The  only  purpose  of  randomizing  is 
to  make  the  other  player  indifferent  among  two  or  more  of  his  own  pure  strategies. 


Theorem  5.2  provides  a necessary,  hut  not  sufficient,  condition  for  a mixed- 
strategy  profile  to  be  a Nash  equilibrium.  To  see  that  the  condition  is  not 
sufficient,  consider  the  game  of  Table  5.6  and  the  mixed-strategy  profile 


Ya  b 

<Y 

rD 

eY 

1 1 

VV2  2 

0, 

? 

i 

V 2 

1 

2 )) 

Player  2 


D E 

A 

Player  1 B 

C 


3 0 

0 2 

0 2 

3 0 

2 0 

2 1 

Table  5.6 

A reduced-form  game  with  cardinal  payoffs 


Given  that  Player  2 plays  the  mixed  strategy 


D E 


Player  1 is  indifferent 


between  the  two  pure  strategies  that  are  in  the  support  of  her  own  mixed 
strategy,  namely  A and  B\  the  payoff  from  playing  A is  1.5  and  so  is  the  payoff 
from  playing  B (and  1.5  is  also  the  payoff  associated  with  the  mixed  strategy 


under  consideration).  However,  the  profile 


Y A 

B 

C1 

' D 

1 

YV2 

1 

2 

oj 

’ 

i 

^ 2 

ij 

is  not  a Nash 


equilibrium,  because  Player  1 could  get  a payoff  of  2 by  switching  to  the  pure 
strategy  C.  We  know  from  Theorem  5.1  that  this  game  does  have  a mixed- 
strategy  Nash  equilibrium.  How  can  we  find  it? 


The  support  of  a mixed  strategy  is  the  set  of  pure  strategies  that  are  assigned  positive 
probability  by  that  mixed  strategy. 
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Let  us  calculate  the  best  response  of  Player  1 to  every  possible  mixed  strategy 

D e \ 

of  Player  2 (with  q e[0,l]).  For  Player  1 the  payoff  from  playing  A 

q i — q j 

against  ( J is  3q  , the  payoff  from  playing  B is  3-3 q and  the  payoff  from 

yq  i— q j 

playing  C is  constant  and  equal  to  2.  These  functions  are  shown  in  Figure  5.7. 


Figure  5.7 

Player  l’s  payoff  from  each  pure  strategy  against  an  arbitrary 
mixed  strategy  of  Player  2. 

The  red  upward-sloping  line  plots  the  function  A(q)  = 3 q , the  blue  downward- 
sloping  line  plots  the  function  B(q)  = 3-3q  and  the  horizontal  green  dotted 
line  the  function  C(q)  = 2 . The  blue  and  green  lines  intersect  when  q = \ and 
the  red  and  green  lines  intersect  when  q = . The  maximum  payoff  is  given  by 
the  blue  line  up  to  q = } , then  by  the  green  line  up  to  q~\  and  then  by  the 
red  line.  Thus  the  best  reply  function  of  Player  1 is  as  follows: 
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Player l's  best  reply  = 


B 
' B 

[p 

C 
r A 

kP 

A 


C ^ 

l~P; 

C 

1 ~P. 


ffA  B 
0 v 


C \(D  E > 


1 


PJ 


l 
V 3 


2 

3 JJ 


or  of  the  form 


ffA  B 


ifO 

&■< 

VI 

<1 

if  q 

_ 1 

3 

if  i 

<q 

<T 

if  q 

_ 2 
3 

ifl 

■<q 

<1 

is  either 

of  the 

C 

' D 

eV 

- P V 

2 

V 3 

i 

3 JJ 

The 


latter  cannot  be  a Nash  equilibrium  for  any  p,  because  when  Player  1 plays  B 
with  probability  0,  E strictly  dominates  D for  Player  2 and  thus  Player  2’s 

r D E ^ 


mixed  strategy 
only  candidate 


is  not  a best  reply  (E  is  the  unique  best  reply).  Thus  the 
for  a Nash  equilibrium  is  of  the  form 

. In  this  case,  by  Theorem  5.11,  we  need/?  to  be  such 

that  Player  2 is  indifferent  between  D and  E,  that  is,  we  need  2p  = 1 - p , that  is, 

p = j . Hence  the  Nash  equilibrium  is 


7 A 

B C ' 

'D 

eY 

P 1 -PJ 

i 

V 3 

2 

3 JJ 

Y A B C > 

'D  EY 

0 1 2 

1 2 

V v u 3 2,  J 

\ 3 3 JJ 

In  games  where  the  number  of  strategies  or  the  number  of  players  are  larger 
than  in  the  examples  we  have  considered,  finding  the  Nash  equilibria  involves 
lengthier  calculations.  However,  computer  programs  have  been  developed  that 
can  be  used  to  compute  all  the  Nash  equilibria  of  a finite  game  in  a relatively 
short  time. 


Test  your  understanding  of  the  concepts  introduced  in  this  section,  by  going 
through  the  exercises  in  Section  5.E.3  of  Appendix  5.E  at  the  end  of  this  chapter. 
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5.4  Strict  dominance  and  rationalizability 

We  remarked  in  the  previous  section  that  a pure  strategy  that  is  strictly 
dominated  by  another  pure  strategy  cannot  be  played  with  positive  probability 
at  a Nash  equilibrium.  Thus,  when  looking  for  a Nash  equilibrium,  one  can 
first  simplify  the  game  by  applying  the  IDSDS  procedure  (Section  1.5.1, 
Chapter  1).  When  payoffs  are  cardinal  (von  Neumann-Morgenstern  payoffs)  it 
turns  out  that,  in  a two-person  game,  a pure  strategy  cannot  be  a best  response  to 
any  mixed-strategy  of  the  opponent  not  only  when  it  is  strictly  dominated  by 
another  pure  strategy  but  also  when  it  is  strictly  dominated  by  a mixed  strategy. 
To  see  this,  consider  the  game  of  Table  5.8  below. 


Player  2 


Table  5.8 

A strategic-form  game  with  cardinal  payoffs. 


The  pure  strategy  B of  Player  1 is  not  strictly  dominated  by  another  pure 
strategy  and  yet  it  cannot  be  a best  reply  to  any  mixed  strategy  of  Player  2.  To 


see  this,  consider  an  arbitrary  mixed  strategy 


' D 
N 


of  Player  2 with 


ge[0,l]  . If  Player  1 plays  B against  it,  she  gets  a payoff  of  1;  if,  instead,  she 

(A 

plays  the  mixed  strategy 


V 3 


5 Ci  .1  2 4 

then  her  payoff  is  — 4(1  - q)  + — 2q  = — > 1 . 


0 f. 


Theorem  5.3  [Pearce,  1984].  Consider  a two-player  reduced-form  game  in 
strategic  form  with  cardinal  payoffs,  an  arbitrary  Player  i and  a pure  strategy  y 
of  Player  i.  Then  there  is  no  mixed-strategy  of  the  opponent  to  which  y is  a 
best  response,  if  and  only  if  y is  strictly  dominated  by  a mixed  strategy  cr.  of 
Player  i (that  is,  there  is  a cr  e S.  such  that  Yli(<Ji,<Jj)>Yli(si,crJ) , for  every 
a.eZ7). 
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Note  that,  since  the  set  of  mixed  strategies  includes  the  set  of  pure  strategies, 
strict  dominance  by  a mixed  strategy  includes  as  a sub-case  strict  dominance  by 
a pure  strategy. 

When  the  number  of  players  is  3 or  more,  the  generalization  of  Theorem  5.3 
raises  some  subtle  issues:  see  Exercise  5.14.  However,  we  can  appeal  to  the 
intuition  behind  Theorem  5.3  (see  Remark  5.5  below)  to  refine  the  IDSDS 
procedure  (Section  1.5.1,  Chapter  1)  for  general  77-player  games  with  cardinal 
payoffs  as  follows. 


Definition  5.6  [Cardinal  IDSDS].  The  Cardinal  Iterated  Deletion  of  Strictly 
Dominated  Strategies  is  the  following  algorithm.  Given  a finite  77-player  (77  > 2) 
strategic-form  game  with  cardinal  payoffs  G,  let  G be  the  game  obtained  by 

removing  from  G,  for  every  Player  i,  those  pure  strategies  of  Player  i (if  any) 

2 

that  are  strictly  dominated  in  G by  some  mixed  strategy  of  Player  i;  let  G be 
the  game  obtained  by  removing  from  G , for  every  Player  i,  those  pure 
strategies  of  Player  i (if  any)  that  are  strictly  dominated  in  G by  some  mixed 
strategy  of  Player  i,  and  so  on.  Let  G°°  be  the  output  of  this  procedure.  Since 
the  initial  game  G is  finite,  G°°  will  be  obtained  in  a finite  number  of  steps.  For 
every  Player  i,  the  pure  strategies  of  Player  i in  G°°  are  called  her  rationalizable 
strategies. 


Figure  5.9  illustrates  this  procedure  as  applied  to  the  game  in  Panel  (i). 
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Player  2 


Player 

1 


D E F 


A 

3 

4 

2 

1 

1 

2 

B 

0 

0 

1 

3 

4 

1 

C 

1 

4 

1 

4 

2 

6 

Player  2 
D E 


Player 

A 

3 

4 

2 

1 

1 

B 

0 

0 

1 

3 

(z)  The  game  G = G 


(Hi)  The  game  G after  Step  2 


Player  2 


Player  2 


D E F 


Player 

A 

3 

4 

2 

1 

1 

2 

1 

B 

0 

0 

1 

3 

4 

1 

(; ii ) The  game  G after  Step  1 


Player  1 


Player  2 

D 


Player  1 A 


(iv)  The  game  G at  the  top. 
The  game  G4  = G°°  at  the  bottom 


Figure  5.9 

Application  of  the  cardinal  IDSDS  procedure. 


In  the  first  step,  the  pure  strategy  C of  Player  f is  deleted,  because  it  is  strictly 
dominated  by  the  mixed  strategy 


I 1 
V 2 2 


thus  yielding  game  G shown  in 


Panel  (ii).  In  the  second  step,  the  pure  strategy  F of  Player  2 is  deleted,  because 

( D E\ 

it  is  strictly  dominated  by  the  mixed  strategy 


thus  yielding  game  G 


V 2 2 


shown  in  Panel  (iii).  In  the  third  step,  B is  deleted  because  it  is  strictly 
dominated  by  A thus  yielding  game  G shown  in  the  top  part  of  Panel  (iv).  In 
the  final  step,  E is  deleted  because  it  is  strictly  dominated  by  D so  that  the  final 
output  is  the  strategy  profile  (A,D).  Hence  the  only  rationalizable  strategies  are 
A for  Player  1 and  D for  Player  2. 
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Note  that,  in  the  game  of  Figure  5.9  (i),  since  the  only  rationalizable  strategy 
profile  is  ( A,D ),  it  follows  that  (A,D)  is  also  the  unique  Nash  equilibrium. 

As  noted  in  Chapter  1 (Section  1.5.1)  the  significance  of  the  output  of  the 
IDSDS  procedure  is  as  follows.  Consider  game  G in  Panel  (i)  of  Figure  5.9. 
Since,  for  Player  1,  C is  strictly  dominated,  if  Player  1 is  rational  she  will  not 
play  C.  Thus,  if  Player  2 believes  that  Player  1 is  rational  then  he  believes  that 
Player  1 will  not  play  C,  that  is,  he  restricts  attention  to  game  G1 ; since,  in  G1 , 
F is  strictly  dominated  for  Player  2,  if  Player  2 is  rational  he  will  not  play  F.  It 
follows  that  if  Player  1 believes  that  Player  2 is  rational  and  that  Player  2 
believes  that  Player  1 is  rational,  then  Player  1 restricts  attention  to  game  G2 
where  rationality  requires  that  Player  f not  play  B,  etc. 

Remark  5.5.  Define  a player  to  be  rational  if  her  chosen  pure  strategy  is  a best 
reply  to  her  belief  about  what  the  opponent  will  do.  In  a two-player  game  a 
belief  of  Player  1 about  what  Player  2 will  do  can  be  expressed  as  a probability 
distribution  over  the  set  of  pure  strategies  of  Player  2;  but  this  is  the  same 
object  as  a mixed  strategy  of  Player  2.  Thus,  by  Theorem  5.3,  a rational  Player 
1 cannot  choose  a pure  strategy  that  is  strictly  dominated  by  one  of  her  own 
mixed  strategies.  The  iterated  reasoning  outlined  above  can  be  captured  by 
means  of  the  notion  of  common  belief  of  rationality.  Indeed,  it  will  be  shown 
in  a later  chapter  that  if  the  players  are  rational  and  there  is  common  belief  of 
rationality  then  only  rationalizable  strategy  profiles  can  be  played.  In  a game 
with  more  than  two  players  a belief  of  Player  i about  her  opponents  is  no 
longer  the  same  object  as  a mixed-strategy  profile  of  the  opponents,  because  a 
belief  can  allow  for  correlation  in  the  behavior  of  the  opponents,  while  the 
notion  of  mixed-strategy  profile  rules  out  such  correlation  (see  Exercise  5.14). 

Remark  5.6.  The  iterated  reasoning  outlined  above  requires  that  the  von 
Neumann-Morgenstern  preferences  of  both  players  be  common  knowledge  between 
them.  For  example,  if  Player  2 believes  that  Player  1 is  rational  but  only  knows 
her  ordinal  ranking  of  the  outcomes,  then  Player  2 will  not  be  able  to  deduce 
that  it  is  irrational  for  Player  1 to  play  C and  thus  it  cannot  be  irrational  for 
him  to  play  F.  Expecting  a player  to  know  the  von  Neumann-Morgenstern 
preferences  of  another  player  is  often  (almost  always?)  very  unrealistic!  Thus 
one  should  be  aware  of  the  implicit  assumptions  that  one  makes  (and  one 
should  question  the  assumptions  made  by  others  in  their  analyses). 

Test  your  understanding  of  the  concepts  introduced  in  this  section,  by  going 
through  the  exercises  in  Section  5.E.4  of  Appendix  5.E  at  the  end  of  this  chapter. 
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Appendix  5.E:  Exercises 


5.E.I.  Exercises  for  Section  5.1: 

Strategic-form  games  with  cardinal  payoffs 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  5.1.  Consider  the  following  game-frame  in  strategic  form,  where 
Oj,o2,o3  and  o4  are  basic  outcomes: 


Player  1 


Player  2 

d 


°1 

02 

° 3 

Both  players  satisfy  the  axioms  of  expected  utility.  The  best  outcome  for  Player 
1 is  o3 ; she  is  indifferent  between  outcomes  o,  and  o4  and  ranks  them  both  as 


worst;  she  considers  o2  to  be  worse  than 


and  better  than  o4;  she  is 


indifferent  between  o2  with  certainty  and  the  lottery 


The  best 


°3  °1 

T-25  0.75, 

outcome  for  Player  2 is  o4 , which  he  considers  to  be  just  as  good  as  ox ; he 
considers  o2  to  be  worse  than  ox  and  better  than  o3 ; he  is  indifferent  between 


o2  with  certainty  and  the  lottery 


0.4  0.6 


Find  the  normalized  von  Neumann-Morgenstern  utility  functions  for  the  two 
players  and  write  the  corresponding  reduced-form  game. 


205 


GAME  THEORY  - Giacomo  Bonanno 


Exercise  5.2.  Consider  the  following  game-frame,  where  o1,...,o4  are  basic 
outcomes. 


Player  2 


Player  1 


C D 


A 


B 


1 3 

v 4 4 y 

f °l  V 

1 1 

V 2 2 J 

°3 

2 3 

V 5 5 J 

Both  players  have  von  Neumann-Morgenstern  rankings  of  the  basic  outcomes. 
The  ranking  of  Player  1 can  be  represented  by  the  following  von  Neumann- 
Morgenstern  utility  function 

outcome:  ox  o,  o3  o4 

Ux:  12  10  6 16 

and  the  ranking  of  Player  2 can  be  represented  by  the  following  von  Neumann- 
Morgenstern  utility  function 

outcome:  o,  o2  o3  o4 

U2\  6 14  8 10' 

Write  the  corresponding  reduced-form  game. 
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5.E.2.  Exercises  for  Section  5.2:  Mixed  strategies. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  5.3.  Consider  the  following  reduced-form  game  with  cardinal  payoffs: 

Player  2 


(a)  Calculate  the  player’s  payoffs  from  the  mixed  strategy 


Y A B 

r D 

EX' 

1 3 

YY4  4 

Y 

i 

V 2 

1 

2 )) 

(b)  Is 


r(A  B 

c' 

rD 

e y 

1 3 

VV4  4 

<Y 

i 

V 2 

1 

2 J) 

a Nash  equilibrium? 


Exercise  5.4.  Consider  the  following  reduced-form  game  with  cardinal  payoffs: 

Player  2 


Prove  that 


D E 


A 


Player  1 

B 

Y A B'' 

' D 

eX 

2 1 

V Y 3 3 J 

i 

V 2 

1 

2 ) ) 

2 3 

8 

6 6 

4 

is  a Nash  equilibrium. 


5.E.3.  Exercises  for  Section  5.3: 

Computing  the  mixed-strategy  Nash  equilibria. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  5.5.  (a)  Find  the  mixed-strategy  Nash  equilibrium  of  the  game 
of  Exercise  5.1.  (b)  Calculate  the  payoffs  of  both  players  at  the  Nash 

equilibrium. 
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Exercise  5.6.  Find  the  Nash  equilibrium  of  the  game  of  Exercise  5.2. 

Exercise  5.7.  Find  all  the  mixed-strategy  Nash  equilibria  of  the  game 
of  Exercise  5.4.  Calculate  the  payoffs  of  both  players  at  every  Nash  equilibrium 
that  you  find. 

Exercise  5.8.  Find  the  mixed-strategy  Nash  equilibria  of  the  following  game: 

Player  2 

T 

Player  1 

C 
B 


Exercise  5.9.  Consider  the  following  two-player  game,  where  z1?  z2,  ...,  z6  are 
basic  outcomes. 


a 

Player  1 b 
c 

Player  1 ranks  the  outcomes  as  indicated  by  A below  and  Player  2 ranks  the 
outcomes  as  indicated  by  B below  (if  outcome  z is  above  outcome  z then  z is 
strictly  preferred  to  z' , and  if  z and  z are  written  next  to  each  other  then  the 
player  is  indifferent  between  the  two). 


Player  2 


d 
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(a)  One  player  has  a strategy  that  is  strictly  dominated.  Identify  the  player  and 
the  strategy. 

[Hint:  in  order  to  answer  the  following  questions,  you  can  make  your  life  a lot 
easier  if  you  simplify  the  game  on  the  basis  of  your  answer  to  part  (a).] 


Player  1 satisfies  the  axioms  of  Expected  Utility  Theory  and  is  indifferent 


between  z6  and  the  lottery 


4 

V 5 


and  is  indifferent  between  z2  and  the 


5 y 


lottery 


1 
V 2 


2 y 


(b)  Suppose  that  Player  1 believes  that  Player  2 is  going  to  play  d with 
probability  \ and  e with  probability  \ . Which  strategy  should  he  play? 


Player  2 satisfies  the  axioms  of  Expected  Utility  Theory  and  is  indifferent 


between  z5  and  the  lottery 


l 
V 4 


2 

4 y 


(c)  Suppose  that  Player  2 believes  that  Player  1 is  going  to  play  a with 
probability  \ and  c with  probability  f . Which  strategy  should  she  play? 

(d)  Find  all  the  (pure-  and  mixed-strategy)  Nash  equilibria  of  this  game. 


Exercise  5.10.  Consider  the  following  game  (where  the  payoffs  are  von 
Neumann-Morgenstern  payoffs): 

Player  2 


C D 


A 

Player  1 

B 


x y 

3 0 

6 2 

0 4 

(a)  Suppose  that  x = 2 and  y = 2.  Find  the  mixed-strategy  Nash  equilibrium 
and  calculate  the  payoffs  of  both  players  at  the  Nash  equilibrium. 


(b)  For  what  values  of  x and  y is 


Y A 

B > 

rc 

D\ 

1 

4 

? 

3 

i 

vv  5 

5 y 

V 4 

4 yy 

a Nash  equilibrium? 


Exercise  5.11.  Find  the  mixed-strategy  Nash  equilibria  of  the  game  of  Exercise 
5.3.  Calculate  the  payoffs  of  both  players  at  every  Nash  equilibrium  that  you 


find. 
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5.E.4.  Exercises  for  Section  5.4: 

Strict  dominance  and  rationalizability 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  5.12.  In  the  following  game,  for  each  player,  find  all  the  rationalizable 
pure  strategies  (that  is,  apply  the  cardinal  IDSDS  procedure). 


Player  A 

1 B 

C 


Player  2 

L M R 


3 

5 

2 

0 

2 

2 

5 

2 

1 

2 

2 

1 

9 

o 

1 

5 

3 

2 

Note:  The  next  three  exercises  are  more  difficult  than  the  previous  ones. 

Exercise  5.13.  Is  the  following  statement  true  or  false?  (Either  prove  that  it  is 
true  or  give  a counterexample.) 

“Consider  a two-player  strategic-form  game  with  cardinal  payoffs.  Let  A 
and  B be  two  pure  strategies  of  Player  1.  Suppose  that  both  A and  B are 
rationalizable  (that  is,  they  survive  the  cardinal  IDSDS  procedure).  Then 
any  mixed  strategy  that  attaches  positive  probability  to  both  A and  B and 
zero  to  every  other  strategy  is  a best  reply  to  some  mixed  strategy  of 
Player  2.” 

Exercise  5.14.  Consider  the  following  three-player  game,  where  only  the 
payoffs  of  Player  1 are  shown. 

Player  2 Player  2 


Player  3:  G Player  3:  H 

(a)  Show  that  if  Player  1 assigns  probability  j to  the  event  “Player  2 will  play 
E and  Player  3 will  play  G”  and  probability  \ to  the  event  “Player  2 will 
play  F and  Player  will  play  H\  then  playing  D is  a best  reply. 
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Next 

we 

want 

to 

show 

/ 

v£ 

F ' 

(C 

H 11 

cr  i = 

l~Pj 

l~y); 

Ip 

U 

of  Players  2 and  3 against  which  D is  a best 

reply  for  Player  1.  We  do  this  in  steps.  First  define  the  following  functions: 
A(p,q)  = Tlx(A,<j_x)  (that  is,  A(p,q)  is  Player  l’s  expected  payoff  if  she  plays 
the  pure  strategy  A against  cr B(p,q)  = H^(B,<j_l) , C(p,q)  = Hl(C,<j_l)  and 
D(p,q)  = ni(D,a_1). 


(b)  In  the  (p,q)  plane  (with  0 < p < 1 and  0 < q < 1)  draw  the  curve 
corresponding  to  the  equation  A(p,q)  = D(p,q)  and  identify  the  region 
where  A(p,q)  > D(p,q) . 

(c)  In  the  (p,q)  plane  draw  the  curve  corresponding  to  the  equation 
C(p,q)  = D(p,q)  and  identify  the  region  where  C(p,q)  > D(p,q) . 

(d)  In  the  (p,q)  plane  draw  the  two  curves  corresponding  to  the  equation 
B{p,q)  = D{p,q)  and  identify  the  region  where  B(p,q)  > D(p,q) . 

(e)  Infer  from  parts  (b)-(c)  that  there  is  no  mixed-strategy  profile  of  Players  2 
and  3 against  which  D is  a best  reply  for  Player  1. 
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000000000000 

Exercise  5.15:  Challenging  Question.  A team  of  n professional  swimmers 
(n  > 2)  - from  now  on  called  players  - are  partying  on  the  bank  of  the 
Sacramento  river  on  a cold  day  in  January.  Suddenly  a passerby  shouts  “Help! 
My  dog  fell  into  the  water!”.  Each  of  the  swimmers  has  to  decide  whether  or 
not  to  jump  into  the  icy  cold  water  to  rescue  the  dog.  One  rescuer  is  sufficient: 
the  dog  will  be  saved  if  at  least  one  player  jumps  into  the  water;  if  nobody  does, 
then  the  dog  will  die.  Each  player  prefers  somebody  else  to  jump  in,  but  each 
player  prefers  to  jump  in  himself  if  nobody  else  does.  Let  us  formulate  this  as  a 
game.  The  strategy  set  of  each  player  i = 1 is  5j  ={J,—J},  where  / stands 
for  ‘jump  in’  and  —J  for  ‘not  jump  in’.  The  possible  basic  outcomes  can  be 
expressed  as  subsets  of  the  set  I = {\,...,n}  of  players:  outcome  Wc/  is 
interpreted  as  ‘the  players  in  the  set  N jump  into  the  water’;  if  N = 0 the  dog 
dies,  while  if  N ^ 0 the  dog  is  saved.  Player  i has  the  following  ordinal  ranking 
of  the  outcomes:  (1)  N ~ N' , for  every  N ^ 0,  N'  ^ 0 with  / <£  N and  i £ N' , 
(2)  N >-  N'  for  every  N ^ 0,  N'  ^ 0 with  / <£  N and  i e N' , (3)  {/}  >-  0 . 


(a)  Find  all  the  pure-strategy  Nash  equilibria. 

(b)  Suppose  that  each  player  i has  the  following  von  Neumann-Morgenstern 
payoff  function  (which  is  consistent  with  the  above  ordinal  ranking): 


v 

v-c 

0 


if  N ^ 0 and  i <£  N 

if N ^ 0 and  i eN  with  0 < c < v . 

if  N = 0 


Find  the  symmetric  mixed-strategy  Nash  equilibrium  (symmetric  means 
that  all  the  players  use  the  same  mixed  strategy). 

(c)  Assuming  that  the  players  behave  according  to  the  symmetric  mixed- 
strategy  Nash  equilibrium  of  part  (b),  is  it  better  for  the  dog  if  n (the 
number  of  players)  is  large  or  if  n is  small?  Calculate  the  probability  that 
the  dog  is  saved  at  the  mixed-strategy  Nash  equilibrium  as  a function  of  n, 
for  all  possible  values  of  c and  v (subject  to  0 < c < v),  and  plot  it  for  the  case 
where  c = 10  and  v = 12. 
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Appendix  5.S:  Solutions  to  exercises 


Exercise  5.1.  The  normalized  von  Neumann-Morgenstern  utility  functions 
are: 


Player  1: 


outcome 

Ur 

outcome  U2 

1 

Player  2:  °l,° 4 1 

o2 

0.25 

o2  0.4 

Oi , o4 

0 

o3  0 

Thus  the  reduced-form  game  is  as  follows: 


Player  1 


Player  2 


c 

d 

a 

0 

1 

0.25 

0.4 

b 

1 

0 

0 

1 

fo  i <0 


is  t(12)  + t (16)  = 15 


Exercise  5.2.  The  expected  utility  of  the  lottery 

V 4 4 J 

for  Player  1 and  |(6)  + |(10)  = 9 for  Player  2.  The  expected  utility  of  the 


lottery 


f°\  o 

1 1 

V 2 2 J 


is  11  for  Player  1 and  10  for  Player  2.  The  expected  utility  of 


the  lottery 


^3 

2 1 

V 5 5 J 


is  12  for  Player  1 and  9.2  for  Player  2.  Thus  the  reduced- 


form  game  is  as  follows. 


Player  2 


Player  1 


C 

D 

A 

15 

9 

11 

10 

B 

6 

8 

12 

9.2 

Exercise  5.3.  (a)  Tlj  = |0+{6+f 4+f 2 =3  and  n2  = |l  + {3  + §4  + f0  = 2. 

(b)  No,  because  if  Player  1 switched  to  the  pure  strategy  C then  her  payoff 
would  be  |3  + |4  = 3.5  >2. 
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Exercise  5.4.  Player  l’s  payoff  is  flj  = f2  + f 8 + ^-6 + -g-4  = 5 . If  Player  f 


switches  to  any  other  mixed  strategy 


4 B 

[P  1~P) 


, while  Player  2’s  strategy  is 


kept  fixed  at 


C D 

i i 

v 2 2 y 


, then  her  payoff  is  T4x=\p2  + \p%  + \(\-p)6  + 


\{\-p)A=5.  Thus  any  mixed  strategy  of  Player  1 is  a best  response  to 


C D 

i i 

V 2 2 


. Similarly,  Player  2’s  payoff  is  II2  = -|3 + -|5 +|6  + |2  = 4.  If  Player  2 


switches  to  any  other  mixed  strategy 


C D 
q \ — q 


, while  Player  l’s  strategy  is 


kept  fixed  at 


A B ^ 

2 1 
v 3 3 y 


then  her  payoff  is  II2  = |g3 + -f (1 -<?)5  + }<?6  + 


\(l- q)2  = A . Thus  any  mixed  strategy  of  Player  2 is  a best  response  to 


fA  B\ 
2 1 

. Hence 

( A B\ 
2 1 

is  a best  reply  to 

f C 
1 

1 

V 3 3 ) 

^3  3 ) 

^ 2 

2 J 

C D 


l i 

V 2 2 J 


reply  to 


' A B"\ 

Ya 

B A 

' c dV 

1 i 

2 1 

, that  is, 

2 

i 

V 3 3 J 

V 

AT 

3 J 

V 2 2 ) J 

and 

is  a Nash  equilibrium. 


is  a 


best 


Exercise  5.5.  (a)  We  have  to  find  the  Nash  equilibrium  of  the  following  game. 

Player  2 

c d 

a 


Player  1 


b 


0 1 

0.25  0.4 

1 

0 1 

To  make  calculations  easier,  let  us  multiply  all  the  payoffs  by  100  (that  is,  we 
re-scale  the  von  Neumann-Morgenstern  utility  functions  by  a factor  of  100): 


Player  2 


Player  1 


c 

d 

a 

0 

100 

25 

40 

b 

100 

0 

0 

100 

There  are  no  pure-strategy  Nash  equilibria.  To  find  the  mixed-strategy  Nash 
equilibrium,  let  p be  the  probability  with  which  Player  1 chooses  a and  q be  the 
probability  with  which  Player  2 chooses  c.  Then,  for  Player  1,  the  payoff  from 


playing  a against 


: d 

[q  i-vj 


must  be  equal  to  the  payoff  from  playing  b against 
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f C d ^ 


U l-qj 


. That  is,  it  must  be  that  25(l-g)  = lOOg , which  yields  q = \- 


Similarly,  for  Player  2,  the  payoff  from  playing  c against 


a b 


must  be 


equal  to  the  payoff  from  playing  d against 


b A 


\P  X~P) 

. This  requires 

,P  l~P) 

100/7  = 40/7  + 100(1-/?),  that  is,  /?  = § . Thus  the  Nash  equilibrium  is 

. (b)  At  the  Nash  equilibrium  the  payoffs  are  20  for  Player  1 

and  62.5  for  Player  2.  (If  you  worked  with  the  original  payoffs,  then  the  Nash 
equilibrium  payoffs  would  be  0.2  for  Player  1 and  0.625  for  Player  2.) 

Exercise  5.6.  We  have  to  find  the  Nash  equilibria  of  the  following  game. 

Player  2 


7a 

7 

7 

d V 

5 

3 

? 

i 

4 

VV8 

V5 

Player  1 


C 

D 

A 

15 

9 

11 

10 

B 

6 

8 

12 

9.2 

For  Player  2 D is  a strictly  dominant  strategy,  thus  at  a Nash  equilibrium 
Player  2 must  play  D with  probability  1.  For  Player  1,  the  unique  best  reply  to 
D is  B.  Thus  the  pure-strategy  profile  ( B,D ) is  the  only  Nash  equilibrium. 

Exercise  5.7.  We  have  to  find  the  Nash  equilibria  of  the  following  game. 

Playe  2 
D E 


Player  1 


B 


2 3 

8 5 

6 6 

4 2 

(B,D)  (with  payoffs  (6,6))  and  (A,E)  (with  payoffs  (8,5))  are  both  Nash 
equilibria.  To  see  if  there  is  also  a mixed-strategy  equilibrium  we  need  to  solve 
the  following  equations,  where  p is  the  probability  of  A and  q is  the  probability 
of  D\  2g  + 8(1  - </)  = 6</ + 4(1  - </)  and  3/?  + 6(1-/?)  = 5/7  + 2(1  - p)  . The  solution 
is  p = j and  q = \ and  indeed  we  verified  in  Exercise  5.4  that 


7 A 

£3 

( C 

d7 

2 

i 

? 

1 

i 

vv  3 

3 7 

^ 2 

2 JJ 

is  a Nash  equilibrium.  The  payoffs  at  this  Nash 
equilibrium  are  5 for  Player  1 and  4 for  Player  2. 
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Exercise  5.8.  Since  B is  strictly  dominated  (by  C),  it  cannot  be  assigned 
positive  probability  at  a Nash  equilibrium.  Let  p be  the  probability  of  T and  q 
the  probability  of  L.  Then  p must  be  such  that  4p  + 0(l-p)  = 3y?  + 2(1  -p)  and  q 
must  be  such  that  q+4(\-q)  = 2q  + (1  -q).  Thus  p = f and  q = |.  Hence  there  is 


only  one  mixed-strategy  equilibrium,  namely 


Y T C 

B > 

f B 

R\ 

2 1 

YV3  3 

9 

3 

V 4 

i 

Ny 

Exercise  5.9.  (a)  Since  Player  1 prefers  z5  to  z3  and  prefers  z6  to  z4,  strategy  b is 
strictly  dominated  by  strategy  c.  Thus,  at  a Nash  equilibrium,  Player  1 will  not 
play  b with  positive  probability  and  we  can  simplify  the  game  to 


Player  1 


Player  2 
d e 


zl 

z2 

z5 

z6 

(b)  Of  the  remaining  outcomes,  for  Player  1 z,  is  the  best  (we  can  assign  utility 
1 to  it)  and  z5  is  the  worst  (we  can  assign  utility  0 to  it).  Since  he  is 


indifferent  between  z6  and  the  lottery 


4 

V 5 


1 

5 J 


, the  utility  of  z6  is  j . Hence 


the  expected  utility  of 


i 

V 2 


1 

2 J 


is  i(-f)  + y(0)  = | and  thus  the  utility  of  z2 


is  also  f . So,  if  Player  2 is  going  to  play  d with  probability  \ and  e with 


probability  then  for  Player  1 playing  a gives  a payoff  of  y(l)  + y(f)  = — 


10  > 


while  playing  c gives  a payoff  of  y(0)  + y(-|)  = -jj.  Hence  he  should  play  a. 


[if  you  did  not  take  the  hint  to  simplify  the  analysis  as  was  done  above, 
then  you  can  still  reach  the  same  answer,  although  in  a lengthier  way.  You 
would  still  set  U{zx)  = 1 . Then  the  expected  payoff  from  playing  a is 


ni(a)  = \U(zl)  + \U(z2)  = \ + \U(z2) 
Since  z2  is  as  good  as 


( , z S 

■6  z5 
1 1 

V 2 2 J 


U{z2)  = \U{z6)  + \U{z5) 


Since  z6  is  as  good  as 


"2/  2 
A 


i i 

V 5 5 J 


(*) 


r) 
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C/(z6)  = | + lC/(z5) 


Replacing  (***)  in  (**)  we  get  U(z2)  = j + jU(z5)  and  replacing  this 
expression  in  (*)  we  get  IT^a)  = j^  + j^U(z5)  . Similarly, 

77i(c)  = \U (z5)  + \U (z6)  = \U (z5)  + i(f  ■ + jU(z5))  = i|  + i |U (z5)  . Now, 
77j(o)  > 77j(c)  if  and  only  if  ^ + -U(z5)>-^  + -^U(z5)  if  and  only  if 
3 >3 U(z5)  if  and  only  if  U(z5)<l  which  is  the  case  because  z5  is  worse 

than  Zj  and  U(zl)  = \.  Similar  steps  would  be  taken  to  answer  (c)  and  (d).] 


(c)  In  the  reduced  game,  for  Player  2 z2  is  the  best  outcome  (we  can  assign 
utility  1 to  it)  and  z6  is  the  worst  (we  can  assign  utility  0 to  it).  Thus,  since 


she  is  indifferent  between  z5  and  the  lottery 


V 4 


3 

4 J 


the  utility  of  z5  is  \ 


and  so  is  the  utility  of  zv  Thus  playing  d gives  an  expected  payoff  of 
i ( I)  + 1 ( I)  = i > while  playing  e gives  an  expected  utility  of  j (1)  + 1 (°)  = { • 
Thus  she  is  indifferent  between  playing  d and  playing  e (and  any  mixture  of 
d and  e). 


(d)  Using  the  calculations  of  parts  (b)  and  (c)  the  game  is  as  follows: 


d 

e 

a 

hi 

1 1 

5 ’A 

c 

o,| 

1 0 
5 ’ v 

There  is  no  pure-strategy  Nash  equilibrium.  At  a mixed-strategy  Nash 
equilibrium,  each  player  must  be  indifferent  between  his/her  two  strategies. 
From  part  (c)  we  already  know  that  Player  2 is  indifferent  if  Player  1 plays 
a with  probability  \ and  c with  probability  f . Now  let  q be  the  probability 
with  which  Player  2 plays  d.  Then  we  need  q + ^(\-q)  = , hence 


q 


2 

7 • 


Thus  the  Nash  equilibrium  is 


r a 

b 

C 

d 

e ' 

1 

0 

3 

2 

5 

V 4 

4 

7 

7/ 

written  more  succinctly  as 


r a 

c 

d 

e ^ 

1 

3 

2 

5 

V 4 

4 

7 

1 J 

which  can  be 
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Exercise  5.10.  (a)  Let  p be  the  probability  of  A and  q the  probability  of  B. 
Then,  Player  1 must  be  indifferent  between  playing  A and  playing  B : 
2 q + 3(1  -q)  = 6q  . This  gives  q =j.  Similarly,  Player  2 must  be  indifferent 
between  playing  C and  playing  D:  2 = 4(1  - p) . This  gives  p = \.  Thus  the 


. The  equilibrium  payoffs  are 


Nash  equilibrium  is  given  by 
•y  = 2.57  for  Player  1 and  2 for  Player  2. 


'(A 

bA 

fc 

jdY 

1 

i 

? 

3 

4 

V v 2 

2 7 

^ 7 

7 JJ 

(b)  Player  1 must  be  indifferent  between  playing  A and  playing  B\ 
|x  + |3  = |6.  Thus  x = 5.  Similarly,  Player  2 must  be  indifferent  between 
playing  C and  playing  D:  \y  + j2  = |4  . Thus  y = 8. 


Exercise  5.11.  We  have  to  find  the  Nash  equilibria  of  the  following  game. 


A 

Player  1 B 

C 


There  are  two  pure-strategy  equilibria,  namely  (B,D)  and  {A,E).  To  see  if  there 
is  a mixed-strategy  equilibrium  we  calculate  the  best  response  of  Player  1 to 

every  possible  mixed  strategy  [ 1 of  Player  2 (with  q e [0,1]).  For  Player 

1 the  payoff  from  playing  A against  J J is  6 - 6q , the  payoff  from  playing 

B is  4q  + 2(1  - q)  = 2 + 2 q and  the  payoff  from  playing  C is  3q  + 4(1  - q)  = 4 - q . 
These  functions  are  shown  in  the  following  diagram. 


Player  2 
D E 


0 1 

6 3 

4 4 

2 0 

3 0 

4 
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The  red  downward-sloping  line  plots  the  function  where  A(q)  = 6-6q,  the 
blue  upward-sloping  line  plots  the  function  B(q)  = 2 + 2 q and  the  green  dotted 
line  the  function  C(q)  = 4-q  . It  can  be  seen  from  the  diagram  that 


Player l's  best  reply : 


A 

' A C ' 

[p  1-pJ 

C 

' B C ' 

v P l~PJ 
B 


if  0 < q < | 

for  any  p e [0, 1]  if  q = f 

if  f <<7<f 

for  any  p e [0, 1]  if  q = f 

if  f < q < 1 


Thus  if  there  is  a mixed-strategy  equilibrium  it  is  either  of  the  form 

. In  the  first  case, 


r(A  C ' 

' D £v 

\ 

v5  C " 

'D  EX' 

2 3 

V 5 5 ) , 

or  of  the  form 

r\  l 

Up  x~p) 

/ V 

\P  l~P) 

2 1 

V 3 3 J J 

where  Player  1 chooses  B with  probability  zero,  E strictly  dominates  D for 


Player  2 and  thus 


D E 

1 1 
V 5 5 y 


is  not  a best  reply  for  Player  2,  so  that 


7 A 

c N 

' D 

E 

\p 

1 -PJ 

9 

2 

^ 5 

i 

JJ 

is  not  a Nash  equilibrium  for  any  p.  In  the  second  case  we 
need  D(p)  = E(p ) , that  is,  4 p = 2(1  - p) , which  yields  p = \ ■ Thus  the  mixed- 
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/ 

v5 

rD  e y 

strategy  Nash  equilibrium  is 

1 2 

2 1 

V 

A3  3 J 

V 3 3 J ) 

with  payoffs  of  -y  for  Player 


1 and  j for  Player  2. 


Exercise  5.12.  For  Player  1,  B is  strictly  dominated  by 

r L AC 

i x 

V 2 2 


A C 


; for  Player  2,  R 


is  strictly  dominated  by 


V 2 2 

. Eliminating  B and  R we  are  left  with 


Player 

L 


2 

M 


Player 

A 

3 

5 

2 

0 

1 

C 

9 

o 

1 

5 

In  this  game  no  player  has  a strictly  dominated  strategy.  Thus  for  Player  1 
both  A and  C are  rationalizable  and  for  Player  2 both  L and  M are 
rationalizable. 


Exercise  5.13. 

game: 


The  statement  is  false.  Consider,  for  example,  the  following 


A 

Player  1 B 
C 


Player  2 

L R 


3 , 1 

o 

O 

0 , 0 

3 , 1 

2 , 1 

2 , 1 

Here  both  A and  B are  rationalizable  (indeed,  they  are  both  part  of  a Nash 
equilibrium;  note  that  the  cardinal  IDSDS  procedure  leaves  the  game 

(A  B\ 

unchanged).  However,  the  mixture  , , 

V 2 2 J 


(which  gives  Player  1 a payoff  of 


1.5,  no  matter  what  Player  2 does)  cannot  be  a best  reply  to  any  mixed  strategy 
of  Player  2,  since  it  is  strictly  dominated  by  C. 
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Exercise  5.14.  (a)  if  Player  1 assigns  probability  j to  the  event  “Player  2 will 
play  E and  Player  3 will  play  G”  and  probability  j to  the  event  “Player  2 will 
play  F and  Player  will  play  FT,  then  A gives  Player  1 an  expected  payoff  of  1.5, 
B an  expected  payoff  of  0,  C and  expected  payoff  of  1.5  and  D an  expected 
payoff  of  2.  Thus  D is  a best  reply  to  those  beliefs. 

The  functions  are  as  follows:  A(p,q)  = 3pq,  B(p,q)  = 3(\- p)q  + 3p(\- q) , 
C(p,  q)  = 3(1  - p)(\  - q)  , D(p,  q)  = 2 pq  + 2(1  - p)(  1 - q) . 


(b)  A(p,q)  = D(p,q)  at  those  points  (p,q)  such  that  q 


2-2  p 

2 -p 


. The  set  of  such 


points  is  the  red  curve  in  the  diagram  below.  The  region  where 
A(p,q)  > D(p,q)  is  the  region  above  the  red  curve. 


(c)  C(p,q)  = D(p,q)  at  those  points  (p,q)  such  that  q 


1 ~P 

1 + p 


. The  set  of  such 


points  is  the  blue  curve  in  the  diagram  below.  The  region  where 
C(p,q)  > D(p,q ) is  the  region  below  the  blue  curve. 


(c)  B(p,q)  = D(p,q ) at  those  points  (p,q)  such  that  q 


— — — — (for  p ^ T).  The 
5-10  p 1 2J 


set  of  such  points  is  given  by  the  two  green  curves  in  the  diagram  below.  The 
region  where  B(p,q)  > D(p,q)  is  the  region  between  the  two  green  curves. 
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Thus  in  the  region  strictly  above  the  red  curve  A is  better  than  D,  in  the 
region  strictly  below  the  blue  curve  C is  better  than  D and  in  the  region  on 
and  between  the  red  and  blue  curves  B is  better  that  D.  Hence,  at  every  point 
in  the  (p,q)  square  there  is  a pure  strategy  of  Player  1 which  is  strictly  better 
than  D.  It  follows  that  there  is  no  mixed-strategy  cr_,  against  which  D is  a best 
reply. 


Exercise  5.15.  (a)  There  are  n pure-strategy  Nash  equilibria:  at  each 

equilibrium  exactly  one  player  jumps  in. 


(b)  Let  p be  the  probability  with  which  each  player  jumps  into  the  water. 
Consider  a Player  i.  The  probability  that  none  of  the  other  players  jump  in 

is  (l  — p)"  ' and  thus  the  probability  that  somebody  else  jumps  in  is 


Player  i s payoff  if  he  jumps  in  is  v-c,  while  his  expected 


payoff  if  he  does  not  jump  in  is  v 1 — fl  — pX  ' + 0(1  -pT  = 


!-(!-/>) 


n— 1 


Thus 


we 


need 


v-c  = v 


l-(l-p) 


n- 1 


that 


is, 


p = 1- 

rc^ 

1 

n— 1 

, which  is  strictly  between  0 and  1 because  c < v. 
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(c)  At  the  Nash  equilibrium  the  probability  that  nobody  jumps  in  is 


Yl  ^ C ^ ^ — 1 

(l  - p)  = — ; thus  this  is  the  probability  that  the  dog  dies.  Hence  the 


dog  is  rescued  with  the  remaining  probability  1 


This  is  a 


vv; 

decreasing  function  of  n.  Thus  the  larger  the  number  of  swimmers  who  are 
present,  the  more  likely  it  is  that  the  dog  dies.  The  plot  of  this  function 
when  c = 10  and  v = 12  is  shown  below. 


223 


GAME  THEORY  - Giacomo  Bonanno 


Chapter 


Dynamic  Games 
with  Cardinal  Payoffs 

6.1  Behavioral  strategies  in  dynamic  games 

The  definition  of  dynamic  (or  extensive-form)  game  with  cardinal 
payoffs  is  just  like  the  definition  of  extensive  form  game  with  ordinal 
payoffs  (Definition  3.1,  Chapter  3),  the  only  difference  being  that  we 
postulate  von  Neumann-Morgenstern  preferences  instead  of  merely  ordinal 
preferences. 

In  Chapter  5 we  generalized  the  notion  of  strategic-form  frame  by  allowing  for 
lotteries  (rather  than  just  simple  outcomes)  to  be  associated  with  strategy 
profiles.  One  could  do  the  same  for  extensive-form  frames,  as  shown  in  Figure 


Figure  6.1 

An  extensive  frame  with  probabilistic  outcomes. 

The  z.  ’s  are  terminal  nodes  and  the  oi ’s  are  basic  outcomes. 
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In  Figure  6.1  [z1,z2,...,z5}  is  the  set  of  terminal  nodes  and  [ox,o2,. 
set  of  basic  outcomes.  Associated  with  z,  is  the  lottery 


r°i  °2^ 
2 1 

V 3 3 J 


lottery  associated  with  z3  is 


(°\  °3 

ill 

V 5 5 5 y 


, etc. 


..,o5}  is  the 
, while  the 


However,  as  we  saw  at  the  end  of  Chapter  3,  in  extensive  forms  one  can 
explicitly  represent  random  events  by  means  of  chance  moves  (also  called 
moves  of  Nature).  Thus  an  alternative  representation  of  the  extensive-form 
frame  of  Figure  6.1  is  the  extensive  form  shown  in  Figure  6.2. 


1 


Figure  6.2 

An  alternative  representation  of  the  extensive  frame  of  Figure  6.1. 
The  terminal  nodes  have  not  been  labeled.  The  o( ’s  are  basic  outcomes. 


Hence  we  can  continue  to  use  the  definition  of  extensive-form  frame  given  in 
Chapter  3,  but  from  now  on  we  will  allow  for  the  possibility  of  chance  moves. 

The  notion  of  strategy  remains,  of  course,  unchanged:  a strategy  for  a player  is 
a list  of  choices,  one  for  every  information  set  of  that  player  (Definition  3.4, 
Chapter  3).  For  example,  the  set  of  strategies  for  Player  1 in  the  extensive  frame 
of  Figure  6.2  is  Sx  = {(a,e),(a,f),(b,e),(b,f)}.  Thus  mixed  strategies  can  easily 
be  introduced  also  in  extensive  frames.  For  example,  in  the  extensive  frame  of 
Figure  6.2,  the  set  of  mixed  strategies  for  Player  1,  T, , is  the  set  of  probability 
distributions  over  Sx : 
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(a,e)  ( a,f ) (< b,e ) 

P q r 


ibj) 

\- p-q-r 


: p,q,r  e [0, 1]  and  p + q + r < 1 


y 

However,  it  turns  out  that  in  extensive  forms  with  perfect  recall  one  can  use 
simpler  objects  than  mixed  strategies,  namely  behavioral  strategies. 

Definition  6.1.  A behavioral  strategy  for  a player  in  an  extensive  form  is  a list 
of  probability  distributions,  one  for  every  information  set  of  that  player;  each 
probability  distribution  is  over  the  set  of  choices  at  the  corresponding 
information  set. 

For  example,  the  set  of  behavioral  strategies  for  Player  1 in  the  extensive  frame 


of  Figure  6.2  is 


b 

1 ~P 


e f 

q i -qj 


: p,q  e [0,1]  > . A behavioral  strategy  is  a 


simpler  object  than  a mixed  strategy:  in  this  example,  specifying  a behavioral 
strategy  for  Player  1 requires  specifying  the  values  of  two  parameters  (p  and  q), 
while  specifying  a mixed  strategy  requires  specifying  the  values  of  three 
parameters  (p,  r and  q ).  Can  one  then  use  behavioral  strategies  rather  than 
mixed  strategies?  The  answer  is  affirmative,  as  Theorem  6.2  below  states.  First 
we  illustrate  with  an  example  based  on  the  extensive  form  of  Figure  6.3  (which 
is  a simplified  version  of  the  extensive  form  of  Figure  6.2,  obtained  by 
removing  the  moves  of  Nature;  the  z.  ’s  are  terminal  nodes  and  the  outcomes 
have  been  omitted). 

1 


Figure  6.3 

An  extensive  frame  with  the  outcomes  omitted.  The  z.’s  are  terminal  nodes. 


The  definition  of  perfect  recall  was  included  as  part  of  Definition  3.1  (Chapter  3). 
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Consider  the  mixed-strategy  profile  cr  = (cr1,cr2)  with 


7«N)  (a,f) 

(b,e) 

and  cr2  = 

r c d ^ 

<*i  = 

1 4 

2 

5 

1 2 

V 12  12 

12 

12  7 

V 3 3 7 

We  can  compute  the  probability  of  reaching  every  terminal  node  as  follows: 


P(z i)  = cr,  ((a,  e)) a2(c ) + cr,  ((a, /)) cr2 (c)  = £(l)  + ^(})  = £ , 

P(z2 ) = ((“> e )) ^2 i4)  + a,  ((a,/)) a2 (d)  = £(f  ) + £(f  ) = f 
p(h)  = °i  {ib,e))a2(c)  + al  ((b,f))a2(c)  = £(i)  + £(i)  = ^ 

P(z4)  = crl((b,e))cT2(d)  = ±(  f)  = £ 

^(z5)  = O'!  ( (b,f))(J2(d ) = £(f ) = $ 

That  is,  the  mixed-strategy  profile  cr  = (cr,,cr2)  gives  rise  to  the  following 


probability  distribution  over  terminal  nodes: 


f v ~ y 7 

-1  ^3  ^4 

5_  J_0_  _7_  _4_ 

V.  36  36  36  36 


5 

ii 

36  7 


consider  the  following  behavioral  strategy  of  Player  1: 


r a 

b 

e 

5 

7 

i 

1 1 

5 

V 12 

12 

1 7 

7 7 

Now 


. What 


probability  distribution  over  the  set  of  terminal  nodes  would  it  determine  in 


conjunction  with  Player  2’s  mixed  strategy  cr2  = 


d 


1 2 
V 3 3 7 


? The  calculations  are 


simple  (P(x)  denotes  the  probability  of  choice  x for  Player  1,  according  to  the 
given  behavioral  strategy): 

P(z, i)  = P(a)  <J2(c)  = ]|(i)  = ^ , P(z2)  = P(a)a2(d)  = £(f  ) = § 

P(z3)  = P(b) <g(c)=7(j)  = 7>  P(z4 ) = P(b) a2 (d)  P(e)  = £(§ ) (j)  = ■ 
P(z5)  = P(b ) <r2  (d)P(f)  = £ (|)  (f ) = f . 


and 


_4_ 

36 


' C ftC 

'a 

b \ e f ) 

rr2  = 

1 2 

, Player  l’s  behavioral  strategy 

5 

1 

7 | 2 5 

V 3 2 ) 

V 12 

12  1 7 7 7 

her  mixed  strategy 


f (a>e)  (a,f)  (b,e)  ( b,fj\ 


V 12 


_4_ 

12 


_2_ 

12 


5 

12  7 


are  equivalent,  in  the  sense 


that  they  give  rise  to  the  same  probability  distribution 
over  terminal  nodes. 


'1  2 3 4 ^5 

5_  10.  J_  A.  IT 

V 36  36  36  36  36  7 


227 


GAME  THEORY  - Giacomo  Bonanno 


Theorem  6.1  [Kuhn,  1953].  In  extensive  forms  with  perfect  recall,  behavioral 
strategies  and  mixed  strategies  are  equivalent,  in  the  sense  that,  for  every  mixed 
strategy  there  is  a behavioral  strategy  that  gives  rise  to  the  same  probability 
distribution  over  terminal  nodes.10 


Without  perfect  recall,  Theorem  6.1  does  not  hold.  To  see  this,  consider  the 

one-player  extensive  form  shown  in  Figure  6.4  and  the  mixed  strategy 

f(a,c)  ( a,d ) (b,c)  ( b,d f\  , , ....  ,.  .,  . 

which  induces  the  probability  distribution 


V 

i 

V 2 


0 


0 0 


0 


1 

2 J 


on  the  set  of  terminal  nodes. 


Consider  an  arbitrary  behavioral  strategy 


'a  b 
kP  l~P 


c d 
q 1 -q) 


whose 


corresponding  probability  distribution  over  the  set  of  terminal  nodes  is 


[pq  p(\-q)  (1  -p)q  (l-p)(l~q) 


In  order  to  have  P(z2)  = 0 it  must  be 


10  A more  precise  statement  is  as  follows.  Consider  an  extensive  form  with  perfect  recall  and  a 
Player  i.  Let  jc  . be  an  arbitrary  profile  of  strategies  of  the  players  other  than  i,  where,  for 

every  j , X is  either  a mixed  or  a behavioral  strategy  of  Player  j.  Then,  for  every  mixed 

strategy  cr  of  Player  i there  is  a behavioral  strategy  bi  of  Player  i such  that  (cr  , x 4 and 

( bj , x_t  1 give  rise  to  the  same  probability  distribution  over  the  set  of  terminal  nodes. 
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that  either  p = 0 or  q = 1.  If  p = 0 then  P(zx ) = 0 and  if  q = 1 then  P(z4)  = 0 . 


Thus  the  probability  distribution 
behavioral  strategy. 


fzx  z2  z3  z4^ 

a ° ° jj 


cannot  be  achieved  with  a 


Since  the  focus  of  this  book  is  on  extensive-form  games  with  perfect  recall,  by 
appealing  to  Theorem  6.3  from  now  on  we  can  restrict  attention  to  behavioral 
strategies. 


As  usual,  one  goes  from  a frame  to  a game  by  adding  preferences  over 
outcomes.  Let  O be  the  set  of  basic  outcomes  (recall  that  with  every  terminal 
node  is  associated  a basic  outcome)  and  £(0 ) the  set  of  lotteries  (probability 
distributions)  over  O . 


Definition  6.2.  An  extensive-form  game  with  cardinal  payoffs  is  an  extensive 
frame  (with,  possibly,  chance  moves)  together  with  a von  Neumann- 
Morgenstern  ranking  . of  the  set  of  lotteries  £{0) , for  every  Player  i . 


As  usual,  it  is  convenient  to  represent  a von  Neumann-Morgenstern  ranking  by 
means  of  a von  Neumann-Morgenstern  utility  function  and  replace  the 
outcomes  with  a vector  of  utilities,  one  for  every  player. 


For  example,  consider  the  extensive  form  of  Figure  6.2  where  the  set  of  basic 
outcomes  is  O = {ol,o2,o3,o4,o5}  and  suppose  that  Player  1 has  a von 
Neumann-Morgenstern  ranking  of  £(0)  that  is  represented  by  the  following 


von  Neumann-Morgenstern  utility  function: 


O,  0~,  0-,  Oa 


outcome 

\ U]  : 5 2 0 1 3 

. Suppose  also  that  Player  2 has  preferences  represented  by  the  von  Neumann- 


Morgenstern  utility  function 


outcome : 

CA: 


o,  o. 


3 6 4 


0 


Then  from  the  extensive  frame  of  Figure  6.2  we  obtain  the  extensive-form  game 
with  cardinal  payoffs  shown  in  Figure  6.5.  Since  the  expected  utility  of  lottery 


o, 


2 

V 3 


1 

3 y 


is  4 for  both  players,  and  the  expected  utility  of  lottery 


o,  o , 


l 

V 5 


1 
5 J 


is  1.2  for  Player  1 and  4 for  Player  2,  we  can  simplify  the  game  by  replacing  the 
first  move  of  Nature  with  the  payoff  vector  (4,4)  and  the  second  move  of 
Nature  with  the  payoff  vector  (1.2,  4).  The  simplified  game  is  shown  in  Figure 
6.6. 
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1 


5 2 0 5 0 1 1 3 

3 643455  0 

Figure  6.5 

An  extensive  game  based  on  the  frame  of  Figure  6.2. 

The  terminal  nodes  have  not  been  labeled.  The  oi ’s  are  basic  outcomes. 

1 


5 0 

Figure  6.6 

A simplified  version  of  the  game  of  Figure  6.5.  The  z.’s  are  terminal  nodes. 
Note  that  this  is  a game  based  on  the  frame  of  Figure  6.3 
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Given  an  extensive  game  with  cardinal  payoffs,  associated  with  every 
behavioral  strategy  profile  is  a lottery  over  basic  outcomes  and  thus,  using  a 
von  Neumann-Morgenstern  utility  function  for  each  player,  a payoff  for  each 

player.  For  example,  the  behavioral  strategy  profile 


for 
^ o 


the  extensive  game  of  Figure  6.5  gives 


7a 

b 

I e 

r 

f c d y 

5 

VV12 

7 

12 

1 2 
1 7 

5 

7 ^ 

? 

1 2 

V 3 3 J ) 

s rise 

to 

the 

lottery 

V 540 


25 

540 


213 

540 


81 

540 


150 
540  y 


(for  instance,  the  probability  of  basic  outcome  o,  is 


calculated  as  follows:  P(ol)  = P(a)P(c)j  + P(b)P(c)\  = t23'f  + T23'i=54o)- 

Using  the  utility  function  postulated  above  for  Player  1,  namely 


, we  get  a corresponding  payoff  of  ^5  + ^2  + 


outcome : ol  o2  o3  o4 

Ux:  5 2 0 1 3 

540  0+ mo  1 + if! 3 = 54o=  1-733.  An  alternative  way  of  computing  this  payoff  is 
by  using  the  simplified  game  of  Figure  6.6  where  the  behavioral  strategy  profile 

yields  the  probability  distribution  over  terminal 


7a 

b 

I e 

r 

' c 

d y 

5 

7 

! 2 

5 

? 

1 

2 

VV12 

12 

1 7 

7 ^ 

V 3 

3v7 

nodes 


5_ 

V36 


JO 

36 


36 


36 


JO 
36  J 


, which,  in  turn,  yields  the  probability  distribution 


4 0 1.2  1 3 


5_ 
V 36 


36 


36 


36 


10 
36  y 


over  utilities  for  Player  1.  From  the  latter  we  get  that  the 


expected  payoff  for  Player  1 is  ^-4  + ^|0  + ^-1.2+^-l  + 3|3  = |||  = 1.733 . The 
calculations  for  Player  2 are  similar  (see  Exercise  6.3). 

Test  your  understanding  of  the  concepts  introduced  in  this  section,  by  going 
through  the  exercises  in  Section  6.E.1  of  Appendix  6.E  at  the  end  of  this  chapter. 


6.2  Subgame-perfect  equilibrium  revisited 

The  notion  of  subgame-perfect  equilibrium  was  introduced  in  Chapter  3 
(Definition  3.5)  for  extensive-form  games  with  ordinal  payoffs.  When  payoffs 
are  ordinal,  a subgame-perfect  equilibrium  might  fail  to  exist  because  either  the 
entire  game  or  a proper  subgame  does  not  have  any  Nash  equilibria.  In  the  case 
of  finite  extensive-form  games  with  cardinal  payoffs,  a subgame-perfect 
equilibrium  always  exists,  because  - by  Nash’s  theorem  (Theorem  5.1,  Chapter 
5)  - every  finite  game  has  at  least  one  Nash  equilibrium  in  mixed  strategies. 
Thus,  in  the  case  of  cardinal  payoffs,  the  subgame-perfect  equilibrium 
algorithm  (Definition  3.6,  Chapter  3)  never  halts  and  the  output  of  the 


231 


GAME  THEORY  - Giacomo  Bonanno 


algorithm  is  a subgame-perfect  equilibrium. 

We  shall  illustrate  this  with  the  extensive-form  game  shown  in  Figure  6.7 
below. 

1 


3 0 0 1 0 1 2 0 

1 2 3 2 3 2 1 3 

Figure  6.7 

An  extensive-form  game  with  cardinal  payoffs. 

Let  us  apply  the  subgame-perfect  equilibrium  algorithm  to  this  game.  We  start 
with  the  proper  subgame  that  begins  at  Player  2’s  decision  node  on  the  left, 
whose  strategic  form  is  shown  in  Table  6.8.  Note  that  this  subgame  has  no 
pure-strategy  Nash  equilibria.  Thus  if  payoffs  were  merely  ordinal  payoffs  the 
algorithm  would  halt  and  we  would  conclude  that  the  game  of  Figure  6.7  has 
no  subgame-perfect  equilibria.  However,  we  will  assume  that  payoffs  are 
cardinal  (that  is,  that  they  are  von  Neumann-Morgenstern  utilities). 


Player  A 
2 B 


Player  3 


C D 


3 , 1 

0 , 2 

0 , 3 

1 , 2 

Table  6.8 

The  strategic  form  of  the  proper  subgame  on  the  left  in  the  game  of  Figure  6.7. 

To  find  the  mixed-strategy  Nash  equilibrium  of  the  game  of  Table  6.8,  let  p be 
the  probability  of  A and  q the  probability  of  C.  Then  we  need  q to  be  such  that 
3q  = 1 - q , that  is,  q = \ , and  p to  be  such  that  p + 3(1  - p)  = 2 , that  is,  p = \ . 
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Thus  the  Nash  equilibrium  of  this  proper  subgame  is 


((A  B ' 

(c 

dX) 

1 1 

5 

1 

3 

^V2  2 ) 

V 4 

4 J) 

yielding  the  following  payoffs:  ^1  + jj2  + ~2  + ^j0  = 1.125  for  Player  1, 
+ + + = 0.75  for  Player  2 and  \\\  + \^2  + \\2>  + \^2  = 2 for 

Player  3.  Thus  we  can  simplify  the  game  of  Figure  6.7  as  follows: 

1 


0 12  0 
3 2 1 3 


Figure  6.9 

The  game  of  Figure  6.7  after  replacing  the  proper  subgame  on 
the  left  with  the  payoffs  associated  with  its  Nash  equilibrium. 


Now  consider  the  proper  subgame  of  the  game  of  Figure  6.9  (the  subgame  that 
starts  at  Player  2’s  node).  Its  strategic  form  is  shown  in  Table  6.10. 


Player  E 
2 F 


Player  3 


G H 


o , 3 

1 , 2 

2 , 1 

0 , 3 

Table  6.10 

The  strategic  form  of  the  proper  subgame  of  the  game  of  Figure  6.9. 


11  Note  that,  by  Theorem  5.2  (Chapter  5)  the  Nash  equilibrium  payoff  of  Player  2 can  be 
computed  much  more  quickly  as  the  expected  payoff  from  the  pure  strategy  A,  namely  j, 
and  the  payoff  of  Player  3 as  the  expected  payoff  from  the  pure  strategy  D,  namely  2. 
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Again,  there  is  no  pure-strategy  Nash  equilibrium.  To  find  the  mixed-strategy 
equilibrium  let  p be  the  probability  of  E and  q the  probability  of  G.  Then  we 
need  q to  be  such  that  l-q  = 2q,  that  is,  q = j,  and  p to  be  such  that 
3p  + \-p  = 2p  + 3(l-p),  that  is,  p = j-  Hence  the  Nash  equilibrium  is 


7 E 

(g 

hV 

2 

i 

i 

2 

vv  3 

3 y 

V 3 

3 )) 

or 

Player 

1, 

yielding  the  following  payoffs:  |}2  + y|0  + j}l  + }|2  = l 

for  Player  2 and 


I i0  + f 1 1 + 1 }2+li  0 = 0.67 


yy3  + y|2  + yyl  + y|3  = 2.33  for  Player  3. 

Thus  we  can  simplify  the  game  of  Figure  6.9  as  shown  in  Figure  6.11  below, 
where  the  optimal  choice  for  Player  1 is  L. 

1 


0.75 

2 


0.67 

2.33 


Figure  6.11 

The  game  of  Figure  6.9  after  replacing  the  proper  subgame 
with  the  payoffs  associated  with  the  Nash  equilibrium. 


Hence  the  subgame-perfect  equilibrium  of  the  game  of  Figure  6.7  (expressed  in 
terms  of  behavioral  strategies)  is: 


(( L 

R) 

fA 

B j E 

F3 
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D ! G 

hX\ 

ll 1 

oj 
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V 2 
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1 1 2 
2 1 3 
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3 y 
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V 4 
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3 l 1 

4 1 3 
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3 yy 

We  conclude  this  section  with  the  following  theorem,  which  is  a corollary  of 
Theorem  5.1  (Chapter  5). 

Theorem  6.2.  Every  finite  extensive-form  game  with  cardinal  payoffs  has  at 
least  one  subgame-perfect  equilibrium 

Test  your  understanding  of  the  concepts  introduced  in  this  section,  by  going 
through  the  exercises  in  Section  6.E.2  of  Appendix  6.E  at  the  end  of  this  chapter. 
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6.3  Problems  with  the  notion  of  subgame- 
perfect  equilibrium 

The  notion  of  subgame-perfect  equilibrium  is  a refinement  of  Nash 
equilibrium.  As  explained  in  Chapter  2,  in  the  context  of  perfect-information 
games,  the  notion  of  subgame-perfect  equilibrium  eliminates  some 
“unreasonable”  Nash  equilibria  that  involve  incredible  threats.  However,  not 
every  subgame-perfect  equilibrium  can  be  viewed  as  a “rational  solution”.  To 
see  this,  consider  the  extensive-form  game  shown  in  Figure  6.12  below.  This 
game  has  no  proper  subgames  and  thus  the  set  of  subgame-perfect  equilibria 
coincides  with  the  set  of  Nash  equilibria.  The  pure-strategy  Nash  equilibria  of 
this  game  are  ( a,f,c ),  (a,e,c) , ( b,e,c ) and  ( b,f,d ).  It  can  be  argued  that 
neither  ( a,f,c ) nor  ( b,f,d ) can  be  considered  “rational  solutions”. 

Consider  first  the  Nash  equilibrium  ( a,f,c ).  Player  2’s  plan  to  play  /is 
rational  only  in  the  very  limited  sense  that,  given  that  Player  1 plays  a,  what 
Player  2 plans  to  do  is  irrelevant  because  it  cannot  affect  anybody’s  payoff;  thus 
/ is  as  good  as  e.  However,  if  we  take  Player  2’s  strategy  as  a “serious”  plan 
specifying  what  Player  2 would  actually  do  if  she  had  to  move,  then  -given  that 
Player  3 plays  c -e  would  give  Player  2 a payoff  of  2,  while  / would  only  give  a 
payoff  of  1.  Thus  e seems  to  be  a better  strategy  than  f if  Player  2 takes  the 
contingency  “seriously”. 


2 0 2 0 

2 0 2 0 

Figure  6.12 

An  extensive-form  game  showing  the  insufficiency 
of  the  notion  of  subgame-perfect  equilibrium. 
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Consider  now  the  Nash  equilibrium  ( b,f,d ) and  focus  on  Player  3.  As  before, 
Player  3’s  plan  to  play  d is  rational  only  in  the  very  limited  sense  that,  given 
that  Player  1 plays  a and  Player  2 plays  f what  Player  3 plans  to  do  is 
irrelevant,  so  that  c is  as  good  as  d.  However,  if  Player  3 did  find  himself  having 
to  play,  it  would  not  be  rational  for  him  to  play  d,  since  d is  a strictly 
dominated  choice:  no  matter  whether  he  is  making  his  choice  at  the  left  node 
or  at  the  right  node  of  his  information  set,  c gives  him  a higher  payoff  than  d. 
How  can  it  be  then  that  d can  be  part  of  a Nash  equilibrium?  The  answer  is 
that  d is  strictly  dominated  conditional  on  Player  3’s  information  set  being 
reached  but  not  as  a plan  formulated  before  the  play  of  the  game  starts.  In  other 
words,  d is  strictly  dominated  as  a choice  but  not  as  a strategy. 

The  notion  of  subgame-perfect  equilibrium  is  not  strong  enough  to  eliminate 
“unreasonable”  Nash  equilibria  such  as  ( a,f,c ) and  ( h,f,d ) in  the  game  of 
Figure  6.12.  In  order  to  do  that  we  will  need  a stronger  notion.  This  issue  is 
postponed  to  a later  chapter. 
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Appendix  6.E:  Exercises 

6.E.I.  Exercises  for  Section  6.1: 

Behavioral  strategies  in  dynamic  games 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  6.1.  What  properties  must  an  extensive-form  frame  satisfy  in  order  for 
it  to  be  the  case  that,  for  a given  player,  the  set  of  mixed  strategies  coincides 
with  the  set  of  behavioral  strategies?  [Assume  that  there  are  at  least  two  choices 
at  every  information  set.] 

Exercise  6.2.  Suppose  that,  in  a given  extensive-form  frame,  Player  1 has  four 
information  sets:  at  one  of  them  she  has  two  choices  and  at  each  of  the  other 
three  she  has  three  choices. 

(a)  How  many  parameters  are  needed  to  specify  a mixed  strategy  of  Player  1? 

(b)  How  many  parameters  are  needed  to  specify  a behavioral  strategy  of  Player 

1? 


Exercise  6.3.  From  the  behavioral  strategy  profile 
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3 )) 

calculate  the  payoff  of  Player  2 in  two  ways:  (1)  using  the  game  of  Figure  6.5 
and  (2)  using  the  simplified  game  of  Figure  6.6. 


Exercise  6.4.  Consider  the  following  extensive  form,  where  ol,...,o5  are  basic 
outcomes. 
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Player  l’s  ranking  of  O is  o,  A,  o5  A,  o4  >-,  o2  o3;  furthermore,  she  is 


indifferent  between  o5  and  the  lottery 


fox  o2  o3^ 


1 1 


between  o4  and  the  lottery 


V 

A 


and  is  also  indifferent 


2 i 

V 3 3 J 


Player  2’s  ranking  of  O is 


°\  ~2  °2  ~2  °4  ^2  °3  ^2  °5  5 furthermore,  he  is  indifferent  between  o3  and  the 


lottery 


fox  o2  o5^ 

J_  J_  _8_ 

Vio  io  io  J 


Finally,  Player  3’s  ranking  of  O is 


°i  ^3  °4  h °3  ~3  °5  >”3  °\ ; furthermore,  she  is  indifferent  between  o4  and  the 


lottery 

r°\  °2 

2 2 
V 5 5 J 


fO]  02  o ^ 

ill 
V 4 2 4 J 


and  is  also  indifferent  between  o3  and  the  lottery 


. Write  the  corresponding  extensive-form  game. 


6.E.2.  Exercises  for  Section  6.2: 

Subgame-perfect  equilibrium  revisited 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  6.5.  Consider  the  following  extensive-form  game  with  cardinal 
payoffs. 


1 u 


2 4 3 1 
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(a)  Write  the  corresponding  strategic-form  game  and  find  all  the  pure- 
strategy  Nash  equilibria. 

(b)  Find  the  subgame-perfect  equilibrium. 

Exercise  6.6.  Consider  the  following  extensive  form  (where  the  basic  outcomes 
are  denoted  by  xj  instead  of  oj,j  = 1,...,  10). 


1 

U 


All  the  players  satisfy  the  axioms  of  expected  utility.  They  rank  the  outcomes 
as  indicated  below  (if  outcome  w is  above  outcome  y then  w is  strictly  preferred 
to  y,  and  if  w and  y are  written  next  to  each  other  then  the  player  is  indifferent 
between  the  two): 


f 


f 


Player  1 : 


A 


Xj  5 X2  9 X4  5 X5 
*10 

V V3 , x6 , x8  j 


Player  2 : 


A 


x4,x5 

X2,X7,  Xg 

Xt 


V 


Player  3 : 


f x1,x1  ^ 
x8 

X\  ? -^4  ? Xg 
\X3’X5’X6  J 


J 


Furthermore,  Player  2 is  indifferent  between  x4  and  the  lottery 


v 2 


1 

2 J 


and 


/N  Y A 

k2  a5 


Although  the 


Player  3 is  indifferent  between  x,  and  the  lottery 

V 2 2 j 

above  information  is  not  sufficient  to  determine  the  von  Neumann- 
Morgenstern  utility  functions  of  the  players,  it  is  sufficient  to  compute  the 
subgame-perfect  equilibrium.  Find  the  subgame-perfect  equilibrium. 
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Exercise  6.7.  Consider  the  following  extensive-form  game  with  cardinal 
payoffs. 

1 


2 0 0 4 1 2 4 1 

0 6 2 1 4 0 3 2 

(a)  Write  the  corresponding  strategic-form  game. 

(b)  Find  all  the  pure-strategy  Nash  equilibria. 

(c)  Find  the  mixed-strategy  subgame-perfect  equilibrium. 

Exercise  6.8.  Consider  the  following  extensive-form  game  with  cardinal 
payoffs: 


(a)  Find  all  the  pure-strategy  Nash  equilibria.  Which  ones  are  also  subgame 
perfect? 
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(b)  [This  is  a more  challenging  question]  Prove  that  there  is  no  mixed-strategy 
Nash  equilibrium  where  Player  1 plays  M with  probability  strictly  between  0 
and  1. 


000000000000 

Exercise  6.9:  Challenging  Question. 

You  have  to  go  to  a part  of  town  where  many  people  have  been  mugged 
recently.  You  consider  whether  you  should  leave  your  wallet  at  home  or  carry 
it  with  you.  Of  the  four  possible  outcomes,  your  most  preferred  one  is  having 
your  wallet  with  you  and  not  being  mugged.  Being  mugged  is  a very  unpleasant 
experience,  so  your  second  favorite  alternative  is  not  carrying  your  wallet  and 
not  being  mugged  (although  not  having  any  money  with  you  can  be  very 
inconvenient).  If,  sadly  enough,  your  destiny  is  to  be  mugged,  then  you  prefer 
to  have  your  wallet  with  you  (possibly  with  not  too  much  money  in  it!) 
because  you  don't  want  to  have  to  deal  with  a frustrated  mugger.  A typical 
potential  mugger,  on  the  other  hand,  does  not  care  whether  or  not  you  are 
carrying  a wallet,  in  case  he  decides  not  to  mug  you  (that  is,  he  is  indifferent 
between  the  corresponding  two  outcomes).  Of  course  his  favorite  outcome  is 
the  one  where  you  have  your  wallet  with  you  and  he  mugs  you.  His  least 
preferred  outcome  is  the  one  where  he  attempts  to  mug  you  and  you  don't 
have  your  wallet  with  you  (he  risks  being  caught  for  nothing).  Denote  the 
possible  outcomes  as  follows: 


You 


Leave  wallet  at  home 
Take  wallet  with  you 


Potential  mugger 

Not  mug  Mug 


Z1 

Z2 

Z3 

Z4 

(a)  What  is  the  ordinal  ranking  of  the  outcomes  for  each  player? 


(b)  Suppose  now  that  both  players  have  von  Neumann-Morgenstern  utility 
functions.  You  are  indifferent  between  the  following  lotteries: 


( z 

Z2 

Z3 

Z4^ 

and  L,  = 

"zi 

Z2  Z3 

Z4^ 

3 

14 

3 

1° 

1 0 

1 

\ 20 

20 

20 

2 y 

A = 


indifferent  between  Z,  = 


; furthermore,  you  are 


A 


0 T 

VU  3 


and  Z4  = 


A 


i 

V 2 


"3  ^4 

0 0 


. The 


potential  mugger  is  indifferent  between  the  two  lotteries 

. For  each  player  find 
the  normalized  von  Neumann-Morgenstern  utility  function. 


l5  = 


( z 

z2  z3  zy 

and  Lb  = 

(Z 

^1 

Z2 

Z3 

Z4  ^ 

i 

1 1 11 

8 

67 

16 

37 

V 4 

4 4 4 J 

\ 128 

128 

128 

128  / 
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You  have  to  decide  whether  or  not  to  leave  your  wallet  at  home.  Suppose  that, 
if  you  leave  your  wallet  at  home,  with  probability  p (with  0 < p < 1)  the 
potential  mugger  will  notice  that  your  pockets  are  empty  and  with  probability 
(1  -p)  he  will  not  notice.  If  the  potential  mugger  does  not  notice  that  your 
pockets  are  empty  then  he  will  nevertheless  be  aware  of  the  fact  that  you  might 
have  left  the  wallet  at  home  and  he  simply  cannot  tell. 

(c)  Represent  this  situation  as  an  extensive  game  with  imperfect  information. 

(d)  Write  the  corresponding  normal  form. 

(e)  Find  all  the  subgame-perfect  equilibria  (including  the  mixed-strategy  ones,  if 
any).  [Hint:  your  answer  should  distinguish  between  different  values  of  p]. 
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Appendix  6.S:  Solutions  to  exercises 

Exercise  6.1.  It  must  be  the  case  that  the  player  under  consideration  has  only 
one  information  set. 

Exercise  6.2.  (a)  53.  The  number  of  pure  strategies  is  2x3x3x3  = 54  and  thus 
53  probabilities  are  needed  to  specify  a mixed  strategy,  (b)  7:  one  probability 
for  the  information  set  where  she  has  two  choices  and  two  probabilities  for 
each  of  the  other  three  information  sets. 


Exercise  6.3.  (I)  The  induced  probability  distribution  on  basic  outcomes  is 

n °l  °2U  °8i  150 1 ■ Thus  Player  2’s  expected  utility  is^3  + ^6  + f£4  + 

V 540  540  540  540  540  J 

Mfl5  + ^0  = J^  = 3.  (2)  The  induced  probability  distribution  on  terminal 


nodes  is 


Z1  Z2  Z3  Z4  Z5 


_5_  10  _7_  _4_  JO. 

36  36  36  36  36 


. Thus  Player  2’s  expected  payoff  is 


^■4  + -^-4  + -^-14 + ^5+  if0  = J^-  = 3.  Not  surprisingly,  the  same  number  as  in 
part  (1). 

Exercise  6.4.  The  normalized  von  Neumann-Morgenstern  utility  functions  are 

"___!  0L_02__°3___  °i___  Hi' 

U~\l~  o"""o"  ~0~.25~~0.75  _ . f . . „ 

. I hus  the  extensive-form  game  is  as  follows: 


r 

! °i 

°2 

°3 

°4 

°5  1 

Ur 

I"  1 

i 

0 

0 

0.25 

0.75 

u2 

! l 

1 

0.2 

1 

0 

[u3 

I 0 

1 

0.4 

0.6 

0.4  J 
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Or,  in  a simplified  form  obtained  by  removing  the  move  of  Nature,  as  follows: 


0 1 0.4  0.6 


Exercise  6.5.  (a)  The  strategic  form  is  as  follows. 

UL 

or  i UR 
Hayerl  D£ 

DR 

The  pure-strategy  Nash  equilibria  are  ( UL,r ) and  ( UR,r ). 

(b)  The  strategic  form  of  the  proper  subgame  that  starts  at  Player  2’s  node  is: 


Player  2 


Player  2 

/ r 


Player  1 


L 

R 


This  game  has  a unique  mixed-strategy  Nash  equilibrium  given  by 
, yielding  Player  1 an  expected  payoff  of  1.5.  Thus  the 
unique  subgame-perfect  equilibrium,  expressed  as  a behavioral-strategy  profile, 


r(L  R' 

(1 

rX\ 

3 1 

? 

1 

1 

Vv  4 4 ) 

V 2 

2 JJ 
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IS 


Y u 

D 

! L 

fl 

vl° 

1 

1 

1 1 

1 

1 

1 

1 4 

4 J 

V 2 

2 JJ 

or,  expressed  as 


Y UL  UR 

DL  DR ' 

n 

o 

o 

3 1 

i 

1 

4 4 J 

V 2 

2 J) 

a mixed-strategy  profile, 


Exercise  6.6.  There  is  only  one  proper  subgame  starting  from  Player  2’s  node; 
its  strategic-form  frame  is  as  follows: 


a 

Player  2 b 
c 

For  Player  2 strategy  c is  strictly  dominated  by  strategy  b (she  prefers  x4  to  x7, 
and  x5  to  x8  and  x6  to  x9)  and  for  Player  3 strategy  / is  strictly  dominated  by 
strategy  d (she  prefers  y to  x3,  and  x4  to  x6  and  x7  to  x9).  Thus  we  can  simplify 
the  game  to  the  following. 

Player  3 

Player  a 
2 b 


a 


xl 

x2 

x4 

x5 

Player  3 
d e f 


Xl 

x2 

x3 

x4 

x5 

x6 

x7 

x8 

x9 

Restricted  to  these  outcomes  the  payers’  rankings  are:  Player  2 : 


x, 


x4,x5 


V x2 


Player  3 : 


. Let  U be  Player  2’s  von  Neumann-Morgenstern  utility 


V *5  J 


function.  The  we  can  set  U (Xj ) = 1 and  U (x2)  = 0 . Thus,  since  she  is  indifferent 


between  x.  and  x and  also  between  x.  and  the  lottery 


^Xj  x,  ^ 

i i 

V 2 2 J 


U(x.)  = U(x  ) 


■.  Let  V be  Player  3’s  von  Neumann-Morgenstern  utility  function.  The  we 
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can  set  V(x2)  = l and  V(x5)  = 0 . Thus,  since  she  is  indifferent  between  x and  x4 


and  also  between  x{  and  the  lottery 
above  game-frame  becomes  the  following  game: 


fx2  x5A 

1 1 
V 2 2 J 


F(Xj)  = V(x4)  = j . Hence  the 


Player  2 


Player  3 
d e 


hd 

0,1 

1 1 
2 ’ 2 

To 

There  is  no  pure-strategy  Nash  equilibrium.  Let  p be  the  probability  of  a and  q 
the  probability  of  d.  Then  for  a Nash  equilibrium  we  need  q = \ and  p = \. 


Hence  in  the  subgame  the  outcome  will  be 


Xj  x2 


X4  X5 


I 
V 4 


1 
4 


. Since  all  of  these 


outcomes  are  better  than  x10  for  Player  1,  Player  1 will  play  d.  Thus  the 


subgame-perfect  equilibrium  is 


f f d 

f a 

b N 

fd 

e 7T 

VU  0y 

i 

V 2 

1 0 

2 

i 

V 2 

Exercise  6.7.  (a)  The  strategic  form  is  as  follows: 


P 

L 

A 

Y 

E 

R 

1 


PLAYER  2 


AC 

AD 

BC 

BD 

LEG 

2,0 

2,0 

0,2 

0,2 

LEH 

2,0 

2,0 

0,2 

0,2 

EFG 

0 , 6 

0 , 6 

4,1 

4,1 

EFH 

0 , 6 

0 , 6 

4,1 

4,1 

REG 

1,4 

4,3 

1,4 

4,3 

REH 

2,0 

1,2 

2,0 

1,2 

REG 

1,4 

4,3 

1,4 

4,3 

REH 

2,0 

1,2 

2,0 

1,2 
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(b)  There  are  no  pure-strategy  Nash  equilibria. 

(c)  First  let  us  solve  the  subgame  on  the 

Player  1 

F 

There  is  no  pure-strategy  Nash  equilibrium.  Let  us  find  the  mixed-strategy 
equilibrium.  Let  p be  the  probability  assigned  to  E and  q the  probability 
assigned  to  A.  Then  p must  be  the  solution  to  6(1  -p)  = 2p+  (l-p)  and  q must 
be  the  solution  to  2q=4(\-q).  Thus  p = j and  q = f.  The  expected  payoff  of 
Player  1 is  -f  = 1.33  , while  the  expected  payoff  of  player  2 is  y-  = 1.714 . 

Next  we  solve  the  subgame  on  the  right: 

Player  2 

C D 

G 

Player  1 

H 

There  is  no  pure-strategy  Nash  equilibrium.  Let  us  find  the  mixed-strategy 
equilibrium.  Let  p be  the  probability  assigned  to  G and  q the  probability 
assigned  to  C.  Then  p must  be  the  solution  to  4p  = 3p  + 2(1  -p)  and  q must  be 
the  solution  to  q + 4{\-q)  = 2q  + (l-q).  Thus  p = f and  q = - f.  The  expected 
payoff  of  Player  1 is  \ = 1.75 . Thus  the  game  reduces  to: 

1 


1,4 

4,3 

2,0 

1,2 

left: 

Player  2 


A B 


2,0 

0,2 

0 , 6 

4,1 

1.714  2.67 


Hence  the  subgame-perfect  equilibrium  is: 
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ff L 

R j E 

F 

! G H) 

(A 

£ j c 

dX\ 

11° 

1 

1 1 5 
1 7 

2 

7 

2 1 

13  3 J 

2 

V 3 

1 

1 1 3 

3 1 4 

1 

4 )) 

Exercise  6.8.  (a)  The  strategic  form  is  as  follows: 


R 

Player  M 

1 L 


Player  3 

chooses  a 


Player  2 

/ 


1 

0 

0 

1 

0 

0 

0 

1 

1 

2 

2 

1 

1 

1 

0 

0 

0 

0 

Player  2 

/ 


R 

1 

0 

0 

1 

0 

0 

M 

0 

1 

0 

0 

0 

0 

L 

1 

1 

0 

0 

0 

0 

Player  3 chooses  b 


The  pure-strategy  Nash  equilibria  are  highlighted:  ( R,l,a ),  ( M,r,a ),  ( L,l,a ),  ( R,l,b ), 
( R,r,b ) and  ( L,l,b ).  They  are  all  subgame  perfect  because  there  are  no  proper 
subgames. 


(b)  Since,  for  Player  3,  a strictly  dominates  b conditional  on  his  information  set 
being  reached,  he  will  have  to  play  a if  his  information  set  is  reached  with 
positive  probability.  Now,  Player  3’s  information  set  is  reached  with  positive 
probability  if  and  only  if  player  1 plays  M with  positive  probability.  Thus  when 
Pr (M)  > 0 the  game  essentially  reduces  to 


player  l's  payoff 
player  2's  payoff 
player  3's  payoff 


Now,  in  order  for  Player  1 to  be  willing  to  assign  positive  probability  to  M he 
must  expect  a payoff  of  at  least  1 (otherwise  R would  be  better)  and  the  only 
way  he  can  expect  a payoff  of  at  least  1 is  if  Player  2 plays  r with  probability  at 
least  \ • Now  if  Player  2 plays  r with  probability  greater  than  \ , then  M gives 
Player  1 a higher  payoff  than  both  L and  R and  thus  he  will  choose  M with 
probability  1,  in  which  case  Player  2 will  choose  r with  probability  1 (and 
Player  3 will  choose  a with  probability  1)  and  so  we  get  the  pure  strategy 
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equilibrium  ( M,r,a ).  If  Player  2 plays  r with  probability  exactly  \ then  Player  1 
is  indifferent  between  M and  R (and  can  mix  between  the  two),  but  finds  L 
inferior  and  must  give  it  probability  0.  But  then  Player  2’s  best  reply  to  a mixed 
strategy  of  Player  1 that  assigns  positive  probability  to  M and  R and  zero 
probability  to  L is  to  play  r with  probability  1 (if  his  information  set  is  reached 
it  can  only  be  reached  at  node  xi).  Thus  there  cannot  be  a mixed-strategy 
equilibrium  where  Player  1 assigns  to  M probability  p with  0 < p < 1 : it  must 
be  either  Pr (M)  =0  or  Pr (M)  = 1. 


f 2 'N 


Exercise  6.9.  (a)  Your  ranking  is  (best  at  the  top,  worst  at  the  bottom) 


VZ2  J 


while  the  potential  mugger’s  ranking  is 


( n,  \ 


Z1’Z3 


\ Z2  J 


(b)  Let  U be  your  utility  function.  Let  U(z  ) = 1,  U(z  ) = a,  U(z4 ) = b and 
U(z  ) = 0,  with  0 < b < a < 1.  The  expected  utilities  are  as  follows: 
EU(Ll)  = ^a  + ^,  EU(L2)  = \b , EU{Li)  = \ and  EU(LA)  = \a.  From 
EU(L2)  = E(L4 ) we  get  that  a = f.  Substituting  this  into  the  equation 
EU (Lj)  = EU (Z2)  gives  b = \.  Thus  U(z3)  = 1,  U(z^)  = f,  U(z4)  = \ and  U(z2) 
= 0.  Let  V be  the  mugger’s  utility  function.  Let  V(z4)  = 1,  V(z{)  = V(z  ) = c 
and  V(z 0)  = 0 with  0 < c < 1.  The  expected  utilities  are  as  follows: 
EV(Li)  = \(2c  + \)  and  EV(L6)  - ^ (24c  + 37) . Solving  EV(L5)  = EV(L6)  gives 
c = j.  Thus,  V(z4)  = 1,  V(zt)  = V(z3)  = j and  V(z2)  = 0. 


(c)  The  extensive  game  is  as  follows: 
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You 


1/8 


1/8 


1/8 


(d)  The  strategic  form  is  as  follows  (for  the  mugger’s  strategy  the  first  item 
refers  to  the  left  node,  the  second  item  to  the  information  set  on  the  right): 

Potential  Mugger 


NN 

NM 

MN 

MM 

L 

2/3  , 1/8 

(2/3)p , (l/8)p 

2/3  (1-p) , 1/8  (1-p) 

0,0 

T 

1 , 1/8 

1/2,1 

1 , 1/8 

1/2,  1 

(e)  At  a subgame-perfect  equilibrium  the  mugger  will  choose  not  to  mug  when 
he  notices  your  empty  pockets.  Thus  the  normal  form  can  be  simplified  as 
follows: 


L 

You 

T 


• If  p < Va  then  Take  is  a strictly  dominant  strategy  for  you  and 
therefore  there  is  a unique  subgame-perfect  equilibrium  given  by  (Take, 
(Not  mug, Mug)). 


Potential  Mugger 


NN  NM 


2/3  , 1/8 

(2/3)p  , (l/8)p 

1 , 1/8 

1/2  , 1 
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• If  p = 3A  then  there  is  a continuum  of  equilibria  where  the  Mugger 
chooses  (Not  mug, Mug)  with  probability  1 and  you  choose  L with 
probability  q and  T with  probability  (1  -q)  for  any  q with  0<q<§ 

(obtained  from  ^q  + \-q>\) 

• If  p > % then  there  is  no  pure-strategy  subgame-perfect  equilibrium. 

Let  q be  the  probability  that  you  choose  L and  r the  probability  that  the 
mugger  chooses  NN.  Then  the  unique  mixed  strategy  equilibrium  is 
given  by  the  solution  to:  |r  + jp(l-r)  = r + \(\-r)  and 

7 4p-3 

j = \pq  + (1  - q)  which  is  q = and  r = . Thus  the  unique 

8 -p  4/7-1 

subgame-perfect  equilibrium  is: 


f L 

T 

NN 

NM 

MN  MM] 

7 

1-/7 

4/7-3 

2 

N 

O 

o 

l 

oo 

1 

OO 

4/7-1 

4/7-1 
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PART  III 


Advanced  Topics  I 


Knowledge, 
Common  Knowledge, 
Belief 
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Chapter 


Knowledge  and 
Common  Knowledge 

7.1  Individual  knowledge 

In  extensive-form  games  with  imperfect  information  we  use  information  sets  to 
represent  what  the  players,  when  it  is  their  turn  to  move,  know  about  past 
choices.  An  information  set  of  Player  i is  a collection  of  nodes  in  the  tree  where 
it  is  Player  i ’s  turn  to  move  and  the  interpretation  is  that  Player  i knows  that  she  is 
making  her  choice  at  one  of  those  nodes,  but  she  does  not  know  which  of  these 
nodes  has  actually  been  reached.  In  this  chapter  we  extend  the  notion  of 
information  set  to  more  general  settings. 

We  start  with  an  example.  After  listening  to  her  patient’s  symptoms,  a doctor 
reaches  the  conclusion  that  there  are  only  five  possible  causes:  (1)  a bacterial 
infection,  (2)  a viral  infection,  (3)  an  allergic  reaction  to  a drug,  (4)  an  allergic 
reaction  to  food  and  (5)  environmental  factors.  The  doctor  decides  to  do  a lab  test. 
If  the  lab  test  turns  out  to  be  positive  then  the  doctor  will  be  able  to  rule  out  causes 
(3)-(5),  while  a negative  lab  test  will  be  an  indication  that  causes  (1)  and  (2)  can  be 
ruled  out.  To  represent  the  doctor’s  possible  states  of  information  and  knowledge 
we  can  use  five  states:  a,  b,  c,  d,  and  e.  Each  state  represents  a possible  cause,  as 
shown  in  Figure  7.1.  We  can  partition  the  set  {a,b,c,d,e}  into  two  sets:  the  set 
{a,b}  , representing  the  state  of  knowledge  of  the  doctor  if  she  is  informed  that  the 
lab  test  is  positive,  and  the  set  {c,d,e} , representing  the  state  of  knowledge  of  the 
doctor  if  she  is  informed  that  the  lab  test  is  negative. 
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bacterial  viral 

infection  infection 


drug  food 

allergy  allergy  environment 


positive  lab  test 


negative  lab  test 


Figure  7.1 


Consider  the  proposition  “the  cause  of  the  patient’s  symptoms  is  either  an  infection 
or  environmental  factors”.  We  can  think  of  this  proposition  as  the  set  of  states 
{a,b,e}  where  the  proposition  is  in  fact  true;  furthermore,  we  can  ask  the  question 
“after  receiving  the  result  of  the  lab  test,  at  which  state  would  the  doctor  know  the 
proposition  represented  by  the  set  {a,b,e}}”.  If  the  true  state  is  a,  then  - after 
viewing  the  result  of  the  lab  test  - the  doctor  will  think  that  it  is  possible  that  the 
state  is  either  a or  b and  in  either  case  it  is  indeed  the  case  that  the  cause  of  the 
patient’s  symptoms  is  either  an  infection  or  environmental  factors,  so  that  the 
doctor  will  know  this;  the  same  is  true  if  the  true  state  is  b.  On  the  other  hand,  if 
the  true  state  is  e then  the  doctor  will  consider  c,  cl  and  e as  possibilities  and  thus  will 
not  be  able  to  claim  that  she  knows  that  the  cause  of  the  patient’s  symptoms  is 
either  an  infection  or  environmental  factors.  Thus  the  answer  to  the  question  “after 
receiving  the  result  of  the  lab  test,  at  which  state  would  the  doctor  know  the 
proposition  represented  by  the  set  {a,b,e}}”  is  “at  states  a and  b”.  We  can  now 
turn  to  the  general  definitions. 

Definition  7.1.  Let  W be  a finite  set  of  states,  where  each  state  is  to  be 
understood  as  a complete  specification  of  the  relevant  facts  about  the  world.  An 
information  partition  is  a partition  I of  W (that  is,  a collection  of  subsets  of  W 
that  (1)  are  pairwise  disjoint  and  (2)  whose  union  covers  the  entire  set  W)\  the 
elements  of  the  partition  are  called  information  sets.  For  every  weW  we 
denote  by  /(w)  the  information  set  that  contains  state  w. 

In  the  example  of  the  doctor,  W = {a,b,c,d,e}  and  J = {{a,b),{c,d,e}} ; 
furthermore,  1(a)  = 1(b)  = {a,  b}  and  1(c)  = 1(d)  = 1(e)  = {c,d,e}  . 

Definition  7.2.  Let  Wbe  a set  of  states.  We  will  call  the  subsets  of  W events. 
Let  I be  a partition  of  W,  E an  event  (thus  E <^W)  and  weW  a state.  We  say 
that  at  w the  agent  knows  E if  and  only  if  the  information  set  to  which  w 
belongs  is  contained  in  E,  that  is,  if  and  only  if  /(tv)  ci  E . 
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In  the  example  of  the  doctor,  where  W = {a,b,c,d,e}  and  J = {{a,b},{c,d,e}}  , 
let  E = {a,b,d,e} ; then  at  a and  b the  doctor  knows  E because  1(a)  = 
I(b)=  {a,b}<zE , but  at  d it  is  not  true  that  the  doctor  knows  E because 
1(d)  = {c,d,e}  <£  E (since  c e 1(d)  but  c <£  E)  and,  for  the  same  reason,  also  at 
c and  e it  is  not  true  that  the  doctor  knows  E. 

Note  that  it  is  possible  that  there  is  no  state  where  the  agent  knows  a given 
event.  In  the  doctor  example,  if  we  consider  event  F = {a,c}  then  there  is  no 
state  where  the  doctor  knows  F. 

Definition  7.3.  Using  Definition  7.2,  we  can  define  a knowledge  operator  K on 
events  that,  given  as  input  any  event  E,  produces  as  output  the  event  KE 
defined  as  the  set  of  states  at  which  the  agent  knows  E.  Let  2H  denote  the  set  of 
events,  that  is  the  set  of  subsets  of  W.  Then  the  knowledge  operator  is  the 
function  K : 2"  — > 2"  defined  as  follows:  for  every  E czW , 

KE  = {weW : I(w)  c:  E}  . 

In  the  example  of  the  doctor,  where  W = {a,b,c,d,e}  and  I = {{a,b},{c,d,e}] , 
let  E = {a,b,d,e}  and  F = {a,c};  then  KE  = {a,b}  and  KF  = 0 . 

Given  an  event  GcT  we  denote  by  — iG  the  complement  of  G,  that  is,  the  set 
of  states  that  are  not  in  G.  For  example,  if  W = {a,b,c,d,e}  and  G = {a,b,d} 
then  —iG  = {c,e}  . Thus  while  KG  is  the  event  that  the  agent  knows  G,  — KG  is 
the  event  that  the  agent  does  not  know  G.  Note  the  important  difference 
between  event  — KG  and  event  K—iG:  if  we—iKG  then  at  state  w the  agent 
does  not  know  G but  she  might  not  know  — iG  either,  that  is,  it  may  be  that 
she  considers  G possible  (I(w)nG  ^ 0)  and  she  also  considers  — iG  possible 
(/(w)n— iG  ^ 0).2  On  the  other  hand,  if  weK—iG  then  at  state  w the  agent 
knows  that  G is  not  true,  because  every  state  that  she  considers  possible  is  in 
— i G (/(tv) ci  — iG).  Thus  K—iG<z—iKG  but  the  converse  inclusion  does  not 


1 If  W contains  n elements,  then  there  are  2"  subsets  of  W,  hence  the  notation  2”  . For 
example,  if  W = {a,b,c}  then  2 w ={0,{a},{b},{c},{a,b},{a,c},{b,c},{a,b,c})  ■ 

" In  the  example  of  the  doctor,  where  W = {a,b,c,d ,e}  and  I = {{a,b},{c,d ,e}) , if  F ={a,c}  (so 
that  —iF  = {b,d,e})  then  KF  = 0 and  K—F  = 0;  for  instance,  if  the  true  state  is  a then  the 
doctor  considers  F possible  (because  her  information  set  is  1(a)  = {a, b]  and 
I(a)nF  = {a}  * 0)  but  she  also  considers  —F  possible  (because  1(a) rt—F  = {b}  ^0). 
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hold.  In  the  example  of  the  doctor,  where  W = {a,b,c,d,e}  and 
I = {{a,b},{c,d,e}} , again  let  E = {a,b,d,e}  (so  that  — E = {c})  and  F = {a,c} 
(so  that  — i F = {b,d,e})-,  then  KE  = {a,b} , — i KE  = {c,d,e} , K—*E  = 0 , KF  = 0 , 
-nKF  = If  and  Ai— »F  = 0 . 

Note  the  interesting  fact  that,  since  we  can  apply  the  knowledge  operator  to 
any  event,  we  can  also  compute  the  event  that  the  agent  knows  that  she  knows 
E (that  is,  the  event  K(KE),  which  we  will  denote  more  simply  as  KKE)  and 
the  event  that  the  agent  knows  that  she  does  not  know  E (that  is,  the  event 
K—iKE).  Continuing  the  example  of  the  doctor,  where  W = {a,b,c,d,e} , 

I = {{a,b},{c,d,e}]  and  E = {a,b,d,e}  KKE  is  the  set  of  states  where  the 
agent  knows  event  KE  = {a,b};  thus  KKE  = {a,b}.  Furthermore,  since 
— KE  = {c,d,e}  , K—iKE  = {c,d,e}  . As  noted  in  the  following  remark,  this  is  not 
a coincidence. 

Remark  7.1.  The  knowledge  operator  K:  2n  -A  2"  satisfies  the  following 
properties  (which  you  are  asked  to  prove  in  Exercise  7.5):  for  every  event 
EcW, 

• KE  c=  E 

• KKE  = KE 

• K-.KE  = -iKE 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  7.E.1  of  Appendix  7.E  at  the 
end  of  this  chapter. 
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7.2  Interactive  knowledge 

We  can  now  extend  our  analysis  to  the  case  of  several  agents  and  talk  about  not 
only  what  an  individual  knows  about  relevant  facts  but  also  about  what  she 
knows  about  what  other  individuals  know  and  what  they  know  about  what  she 
knows,  etc.  There  is  an  entertaining  episode  of  the  TV  series  Friends  in  which 
Phoebe  and  Rachel  reason  about  whether  Chandler  and  Monica  know  that 
they  (= Phoebe  and  Rachel)  know  that  Chandler  and  Monica  are  having  an 
affair.  Search  for  the  string  “Friends-They  Don't  Know  That  We  Know  They 
Know  We  Know”  on  www.youtube.com  or  point  your  browser  to 
https:/ / www.youtube.com/ watchPv  = LUN2YNObOi8 

Again  we  start  with  a set  of  states  W,  where  each  state  represents  a complete 
description  of  the  relevant  facts.  Let  there  be  n individuals.  To  represent  the 
possible  states  of  mind  of  each  individual  we  use  an  information  partition:  f 

denotes  the  partition  of  individual  i e {l,...,n}.  As  before,  we  call  the  subsets  of 
W events.  Using  Definition  7.3  we  can  define  a knowledge  operator  for  every 
individual.  Let  Kf  be  the  knowledge  operator  of  individual  i;  thus,  for  every 
event  E cz  JT  , KtE  - {w  e W : It  (w)  c;  E} . Now  consider  an  event  E and  an 
individual,  say  Individual  1;  since  KXE  is  an  event  (it  is  the  set  of  states  where 
Individual  1 knows  event  E ),  we  can  compute  the  event  K2KlE , which  is  the 
event  that  Individual  2 knows  event  KXE  , that  is,  the  event  that  2 knows  that  1 
knows  E.  But  there  is  no  need  to  stop  there:  we  can  also  compute  the  event 
K2K2KlE  (the  event  that  3 knows  that  2 knows  that  1 knows  E)  and  the  event 
KXK3K2KXE  (the  event  that  1 knows  that  3 knows  that  2 knows  that  1 knows 
E),  etc.  A few  examples  will  be  useful. 

We  begin  with  an  abstract  example,  without  interpreting  the  states  in  terms  of 
specific  facts.  Let  the  set  of  states  be  W = {a,  b,  c,  d,  e,  f,  g,  h}.  There  are  three 
individuals,  Ann,  Bob  and  Carol,  with  the  information  partitions  shown  in  Figure 
7.2. 


257 


GAME  THEORY  - Giacomo  Bonanno 


Ann 

a 

\ [ b 

cj 

d 

Bob  [ 

a 

b } \ 

c 

d 

Carol 

r \ 

a 

b 

c 

d 

K 

) 

Figure  7.2 

Consider  the  event  E = {a,  b,  c,fg}.  Let  us  compute  the  following  events:  KArrtE 
(the  event  that  Ann  knows  E ),  KBobE,  KCarolE,  KCarol  K/]nitE  (the  event  that  Carol 
knows  that  Ann  knows  E),  KBol)KCarol  KAmJE  and  KAnn  ^KHoftKc.imtE  (the  event  that 

Ann  knows  that  Bob  does  not  know  that  Carol  knows  E ).  All  we  need  to  do  is 
apply  Definition  7.3.  First  of  all, 

• KAnnE  = {a’hA 

(for  example,  beKAnnE  because  I Atm  (b)  = {b,  c}  and  {b,c}  c=  E , while  / <£  KAnnE 
because  IAnn(f)  = {e,f,g}  and  {e,f,g}  is  not  a subset  of  A).  Similarly, 

• KBobE  = {a,b,f),  and 

• KCarolE  = {b’C’f’g}- 

To  compute  KCarolKAnnE  we  need  to  find  the  set  of  states  where  Carol  knows  event 
{a,b,c}  , since  KAnnE  = {a,b,c} . Thus 

• KCarol^An!£  = {b,c}. 

={a,b,c} 

Hence  KBoh  KCaro,KAimE  = 0 , that  is,  there  is  no  state  where  Bob  knows  that  Carol 

={b,c } 

knows  that  Ann  knows  E.  To  compute  KAnn-nKBobKCaroIE  first  we  start  with 
KCaroiE  > which  we  have  already  computed:  KCarolE  = {b,c,f,g}  ; then  we  compute 
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KBob  KcaroiE  > which  is  {/}  : KBobKCarolE  = {/} ; then  we  take  the  complement  of 

={b,c,f,g] 

this:  — i KBobKCarolE  = {a,b,c,d,e,g,h}  and  finally  we  compute  KAnn  of  this  event: 

* KAnn  7 KBobKCaro,E  = {«.  b,  C,  <j,  K)  . 

={a,b,c,d  ,e,g,h} 

Thus,  for  example,  at  state  <2  it  is  true  that  Ann  knows  that  Bob  does  not  know  that 
Carol  knows  E,  while  at  state  e this  is  not  true. 

Next  we  discuss  a concrete  example.  The  professor  in  a game  theory  class  calls  three 
students  to  the  front  of  the  room,  shows  them  a large  transparent  box  that  contains 
many  hats,  some  red  and  some  white  (there  are  no  other  colors)  and  tells  them  the 
following: 


“I  will  blindfold  you  and  put  a hat  on  each  of  you,  then  I will 
remove  the  box  from  the  room  and,  after  that,  I will  remove 
your  blindfolds,  so  that  each  of  you  can  see  the  color(s)  of  the 
hats  worn  by  the  other  two  students,  but  you  will  not  be  able 
to  see  the  color  of  your  own  hat.  Then  I will  ask  you 
questions  starting  with  Student  1 then  Student  2 then  Student 
3 then  back  to  Student  f and  so  on.” 

After  having  placed  the  hats  and  removed  the  blindfolds,  the  professor  asks  Student 
f “do  you  know  the  color  of  your  own  hat?”.  She  replies  “No”.  Then  he  asks 
Student  2 the  same  question:  “do  you  know  the  color  of  your  own  hat?”.  Student  2 
says  “No”.  Then  he  asks  Student  3 and  she  says  “No”.  Then  he  asks  the  same 
question  again  to  Student  1 and  again  the  answer  is  “No”,  and  so  on.  After  asking 
the  same  question  over  and  over  and  always  hearing  the  answer  “No”  he  gets  tired 
and  tells  them  “I’ll  give  you  a piece  of  information:  I did  not  pick  three  white 
hats”.  He  then  resumes  the  questioning:  first  he  asks  Student  f “do  you  know  the 
color  of  your  own  hat?”.  She  replies  “No”.  Then  he  asks  Student  2 the  same 
question:  “do  you  know  the  color  of  your  own  hat?”.  Student  2 says  “No”.  Then  he 
asks  Student  3 and  she  says  “Yes  I do!”  What  color  hat  does  she  have?  What  color 
hats  do  Students  1 and  2 have? 

To  answer  these  questions,  we  begin  by  defining  the  set  of  possible  states.  We  can 
think  of  a state  as  a triple  ( C, , C2 , C3 ) , where  Cf  e {R,  W}  is  the  color  of  the  hat  of 

Student  i ( R means  Red  and  W means  White).  Thus,  for  example,  ( R,W,R ) is  the 
state  where  Students  f and  3 have  a red  hat,  while  Student  2 has  a white  hat.  The 
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possible  states  of  information  of  the  three  students  before  the  professor  announces  that 
he  did  not  pick  three  white  hats  are  represented  by  the  information  partitions  shown 
in  Figure  7.3,  where  we  have  connected  with  a line  states  that  are  in  the  same 
information  set.  Whatever  the  state  (that  is,  whatever  hats  the  professor  picks),  each 
student  is  only  uncertain  about  the  color  of  her  own  hat:  she  can  see,  and  thus 
knows,  the  colors  of  the  hats  of  the  other  two  students.  Thus  each  information  set 
contains  only  two  elements. 


Student  1:  (W,W,W)  (W,W,R)  (W,R,W)  (W,R,R)— (R,R,R)  (R,R,W)  (R,W,R)  (R,W,W) 


r ^ 

Student  2:  (W,W,W)  (W,W,R)  (W,R,W)  (W,R,R)  (R,R,R)  (R,R,W)  (R,W,R)  (R,W,W) 

^ 


Student  3:  (W,W,W)— (W,W,R)  (W,R,W)— (W,R,R)  (R,R,R) — (R,R,W)  (R,W,R)— (R,W,W) 


Figure  7.3 


Consider  a particular  state,  say  the  state  where  all  three  hats  are  red:  (R,R,R) . At 

that  state,  obviously,  each  student  knows  that  not  all  hats  are  white:  he/she  can 
actually  see  two  red  hats.  Furthermore,  each  student  knows  that  every  other 

student  knows  that  not  all  hats  are  white.  Take,  for  example,  Student  1.  She  sees 
that  the  hats  of  the  other  two  students  are  red  and  thus  she  reasons  that  Student  2 
also  sees  that  the  hat  of  Student  3 is  red  and  hence  Student  2 knows  that  not  all  hats 
are  white  (similarly,  she  reasons  that  Student  3 knows  that  not  all  hats  are  white). 
But  does  Student  f know  that  Student  2 knows  that  Student  3 knows  that  not  all 
hats  are  white?  The  answer  is  No.  Seeing  two  red  hats,  Student  f must  consider  it 
possible  that  her  own  hat  is  white,  in  which  case  Student  2 would,  like  Student  1, 
see  a red  hat  on  Student  3 but  (unlike  Student  1)  would  also  see  a white  hat  on 


It  is  unfortunate  that  many  people  would  use,  incorrectly,  the  expression  “all  hats  are  not 
white”  to  mean  that  “it  is  not  the  case  that  all  hats  are  white”.  The  latter  expression  is 
equivalent  to  “at  least  one  hat  is  red  (possibly  one,  possibly  two,  possibly  all  three)”,  while 
the  former  is  equivalent  to  “every  hat  is  not  white”,  that  is,  in  this  context,  “every  hat  is  red”. 
[It  is  also  unfortunate  that  when  asked  “how  are  you?”  most  people  answer  “I  am  good”, 
instead  of  “I  am  well”  or  “I  am  fine”!] 
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Student  1;  thus  Student  2 would  have  to  consider  the  possibility  that  her  own  hat  is 
white  in  which  case,  putting  herself  in  the  shoes  of  Student  3,  would  reason  that 
Student  3 would  see  two  white  hats  and  consider  it  possible  that  his  own  hat  was 
also  white,  that  is,  consider  it  possible  that  all  hats  were  white.  We  can  see  this  more 
clearly  by  using  the  information  partitions  and  the  associated  knowledge  operators. 
To  simplify  matters,  let  us  assign  names  to  the  states  and  rewrite  the  information 
partitions  using  these  names,  as  shown  in  Figure  7.4. 


(W,W,W)  (W,W,R)  (W,R,W)  (W,R,R)  (R,R,R)  (R,R,W)  (R,W,R)  (R,W,W) 

a b c d e f g h 

Student  l’s  partition:  {{a,h},{b,g},{c,f},{d,e}} 

Student  2’s  partition:  {{o,c},  {b,d},{e,g},{f,h}} 

Student  3’s  partition:  [{a,b},{c,d},{e,f},{g,h}] 

Figure  7.4 

The  proposition  “not  all  hats  are  white”  corresponds  to  event 
E = {b,c,d,e,f,g,h}  (the  set  of  all  states,  excluding  only  state  a).  Using  Definition 
7.3  we  get  the  following  events: 

K]E  = {b,c,d,e,f  ,g}, 

K2E  = {b,d,e,f,g,h}, 

K3E  = {c,d,e,f,g,h} 

KXK2E  = K2KXE  = {b,d,e,g}, 

KxK3E  = K3KxE  = {c,d,e,f}, 

K2K3E  = K3K2E  = {e,f,g,h} 

Note  that  the  intersection  of  all  these  events  is  the  singleton  set  {e}  . Thus  at  state  e 
(where  all  the  hats  are  red),  and  only  at  state  e,  everybody  knows  that  not  all  hats 
are  white  and  everybody  knows  that  everybody  knows  that  not  all  hats  are  white. 
Proceeding  one  step  further,  we  have  that 

KXK2K3E  = KXK3K2E  = K2KXK3E  = K2K3KxE  = K3KxK2E  = K3K2KxE  = 0 . 
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Thus  there  is  no  state  (not  even  e)  where  a student  knows  that  another  student 
knows  that  the  third  student  knows  that  not  all  hats  are  white. 

Let  us  now  continue  with  the  formal  analysis  of  the  story  of  the  three  hats.  At 
some  stage  the  professor  makes  the  public  announcement  “I  did  not  choose  three 
white  hats”  (that  is,  he  announces  event  E = {b,c,d,e,f,g,h}).  This 
announcement  makes  it  commonly  known  that  state  a is  to  be  ruled  out.  Thus, 
after  the  announcement,  the  information  partitions  are  reduced  to  the  ones  shown 
in  Figure  7.5,  obtained  from  Figure  7.3  by  deleting  the  state  (W,  W , W) . 


Student  1: 


Student  2: 


Student  3: 


(W,W,R)  (W,R,W)  (W,R,R) — (R,R,R)  (R,R,W)  (R,W,R)  (R,W,W) 


r > r a 

(W,W,R)  (W,R,W)  (W,R,R)  (R,R,R)  (R,R,W)  (R,W,R)  (R,W,W) 


(W,W,R)  (W,R,W) — (W,R,R)  (R,R,R)— (R,R,W)  (R,W,R)— (R,W,W) 


Figure  7.5 


Note  that,  at  this  stage,  if  the  true  state  is  (R,W,W)  Student  1 knows  that  her  hat  is 

red  (she  sees  two  white  hats  and,  having  been  informed  that  the  professor  did  not 
choose  three  white  hats,  she  can  deduce  that  hers  must  be  red).  Similarly,  if  the  true 
state  is  (W,R,W) , Student  2 knows  that  her  own  hat  is  red  and,  if  the  true  state  is 

(W,W,  R) , Student  3 knows  that  her  own  hat  is  red.  According  to  the  story,  after 

announcing  that  not  all  hats  are  white,  the  professor  first  asks  Student  f if  she 
knows  the  color  of  her  hat  and  she  answers  No.  From  this  answer  everybody  can 
deduce  that  the  state  is  not  (R,W,W)  and  thus  this  state  can  be  deleted  and  the 
information  partitions  reduce  to  the  ones  shown  in  Figure  7.6. 
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Student  1: 


Student  2: 


Student  3: 


(W,W,R)  (W,R,W)  (W,R,R) — (R,R,R)  (R,R,W)  (R,W,R) 

v ^ 

(W,W,R)  (W,R,W)  (W,R,R)  (R,R,R)  (R,R,W)  (R,W,R) 


(W,W,R)  (W,R,W) — (W,R,R)  (R,R,R)— (R,R,W)  (R,W,R) 

Figure  7.6 


Now,  according  to  the  story,  Student  2 is  asked  whether  she  knows  the  color  of 
her  hat.  Let  us  see  what  the  possibilities  are. 

1.  Student  2 will  answer  Yes  if  state  is  either  (W,R,W)  (in  fact,  in  this 
case,  she  knew  even  before  hearing  Student  l’s  answer)  or  ( R,R,W ) (in 
this  case,  before  hearing  Student  l’s  answer,  she  thought  the  state  might 
be  either  ( R,R,W ) or  (R,W,W)  but  then,  after  hearing  Student  l’s 

answer,  she  was  able  to  eliminate  (R,W,W)  as  a possibility).  In  either  of 

these  two  states  Student  2’s  hat  is  red  and  thus  she  knows  that  her  hat  is 
red.  Furthermore,  upon  hearing  Student  2 say  Yes,  Student  3 learns  that 
the  state  is  either  ( W,R,W ) or  ( R,R,W ) and  in  both  of  these  states  his 
hat  is  white,  thus  he  acquires  the  knowledge  that  his  own  hat  is  white. 

2.  In  each  of  the  remaining  states,  namely  ( W,W,R ),  ( W,R,R ),  (R,R,R) 
and  ( R,W,R ),  Student  2 will  answer  No.  Then  everybody  learns  that 
the  state  is  neither  (W,R,W)  nor  ( R,R,W ) and  thus  the  information 
partitions  reduce  to  the  ones  shown  in  Figure  7.7. 
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Student  1: 

(W,W,R) 

(W,R,R) — (R,R,R) 

(R,W,R) 

Student  2: 

S' — 

(W,W,R) 

(W,R,R) 

(R,RR) 

(R,W,R) 

Student  3: 

(W,W,R) 

(W,R,R) 

(R,R,R) 

(R,W,R) 

Figure  7.7 

Note  that  each  of  the  remaining  states  is  now  a singleton  information 
set  for  Student  3 and  thus,  upon  hearing  Student  2 say  No,  he  learns 
what  the  state  is:  in  particular  he  learns  that  his  own  hat  is  red  (at  each 
of  these  states  Student  3’s  hat  is  red).  In  the  original  story,  Students  1 
and  2 answered  No  and  Student  3 answered  Yes  and  thus  we  fall  in  this 
second  case. 

Now  consider  a blindfolded  witness,  who  cannot  see  the  color  of  anybody’s 
hat,  but  knows  that  they  are  either  red  or  white  and  hears  the  professor’s 
announcement  and  subsequent  questions,  as  well  as  the  answers  by  the  three 
students.  What  would  the  blindfolded  witness  learn  about  the  true  state?  The 
initial  information  set  of  the  witness  would  consist  of  the  set  of  all  states;  then, 
(1)  after  the  announcement  of  the  professor,  he  would  be  able  to  eliminate  state 
( W,W,W ),  (2)  after  the  negative  answer  of  Student  1 he  would  be  able  to 

eliminate  state  [R,W,W)  and  (3)  after  the  negative  answer  of  Student  2 he 

would  be  able  to  eliminate  states  ( W,R,W ) and  (R,R,W).  The  affirmative 

answer  of  Student  3 is  not  informative,  because  in  each  of  these  states  Student  3 
knows  that  the  color  of  her  own  hat  is  red.  Thus  at  the  end  of  the  exchange  the 
witness’s  information  set  is 

{() W , W, R), (i W,R,R ), (R, R, R), (R,  W, R)}  . 

Hence  all  the  witness  knows  is  that  the  hat  of  Student  3 is  red  (on  the  other 
hand,  Student  3 knows  the  color  of  all  the  hats,  because  she  has  figured  out  the 
color  of  her  own  hat  and  can  see  the  hats  of  the  other  two  students). 
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Let  us  now  focus  on  state (R,R,R)  where  all  hats  are  red.  Initially,  before  the 

professor  makes  the  announcement,  no  student  is  ever  able  to  figure  out  the  color 
of  her  own  hat,  no  matter  how  many  times  the  students  are  asked.  However,  as  we 
saw,  once  the  professor  announces  that  not  all  hats  are  white,  then  after  Students  f 
and  2 reply  negatively  to  the  question  whether  they  know  the  color  of  their  own 
hat,  Student  3 is  able  to  deduce  that  her  hat  is  red.  Thus  the  professor’s 
announcement  provides  crucial  information.  This,  however,  seems  puzzling, 
because  the  professor  merely  tells  the  students  what  they  already  knew:  each  student, 
seeing  two  red  hats,  knows  that  not  all  hats  are  white  (furthermore,  as  we  saw 
above,  each  student  also  knows  that  every  other  students  knows  this).  So  how  can 
giving  the  students  a piece  of  information  that  they  already  possess  make  any 
difference?  The  answer  is  that  the  professor’s  public  announcement  makes  it  a 
matter  of  common  knowledge  that  not  all  hats  are  white.  Indeed,  we  saw  above  that 
- at  the  beginning  - if  the  true  state  is  (R,R,R) , although  everybody  knows  that 

not  all  hats  are  white  and  also  everybody  knows  that  everybody  knows  that  not  all 
hats  are  white,  it  is  not  the  case  that  Student  1 knows  that  Student  2 knows  that 
Student  3 knows  that  not  all  hats  are  white.  Thus  it  is  not  common  knowledge  that 
not  all  hats  are  white.  The  notion  of  common  knowledge  is  discussed  in  the  next 
section. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  7.E.2  of  Appendix  7.E  at  the 
end  of  this  chapter. 


7.3  Common  knowledge 

Common  knowledge  is  the  strongest  form  of  interactive  knowledge:  an  event  E is 
common  knowledge  if  everybody  knows  E and  everybody  knows  that  everybody 
knows  E and  everybody  knows  that  everybody  knows  that  everybody  knows  E, 
and  so  on.  For  example,  in  the  case  of  two  individuals,  we  say  that  at  state  w event 
E is  common  knowledge  if 

w e KXE  n K2E  n KXK2E  n K2KXE  n KtK2K}E  n K2KXK2E  n ... 

We  denote  by  CKE  the  event  that  (that  is,  the  set  of  states  where)  event  E is 
common  knowledge.  Thus,  in  the  case  of  two  individuals, 

CKE  = KXE  n K2E  n KXK2E  n K2KXE  n KXK2KXE  n K2K]K2E n ... 
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Given  the  definition  of  common  knowledge,  it  may  seem  impossible  to  check  if  an 
event  is  common  knowledge,  because  it  requires  checking  an  infinite  number  of 
conditions.  We  will  see  that,  on  the  contrary,  it  is  very  easy  to  determine  if  an  event 
is  common  knowledge  at  any  state.  We  begin  with  an  example. 

Example  7.1.  Abby  proposes  the  following  to  Bruno  and  Caroline. 

“Tomorrow  I will  put  you  in  two  separate  rooms,  so 
that  there  will  be  no  possibility  of  communication 
between  you.  I will  then  pick  randomly  an  even 
number  from  the  set  {2,4,6}.  Let  n be  that  number. 

Then  I will  write  the  number  n—  1 on  a piece  of  paper 
and  the  number  n + 1 on  another  piece  of  paper,  shuffle 
the  two  pieces  of  paper  and  hand  one  to  Bruno  and  the 
other  to  Caroline.  For  example,  if  I happen  to  pick  the 
number  6,  then  I will  write  5 on  a piece  of  paper  and  7 
on  another  piece  of  paper,  shuffle  and  give  one  piece  of 
paper  to  each  of  you.  After  seeing  the  number  handed 
to  you,  each  of  you  will  then  write  a pair  of  numbers 
on  your  piece  of  paper  and  return  it  to  me.  If 
(1)  you  write  the  same  pair  of  numbers  and  (2)  at  least 
one  of  the  two  numbers  is  equal  to  the  number  that  was 
actually  given  to  Bruno  then  I will  give  $1,000  to  each 
of  you,  otherwise  each  of  you  will  give  me  $1,000.” 

Should  Bruno  and  Caroline  accept  to  play  this  game?  They  can  agree  today  on  how 
they  should  act  tomorrow  under  various  contingencies,  bearing  in  mind  that  they 
will  be  unable  to  communicate  with  each  other  tomorrow. 

We  will  see  that  Bruno  and  Caroline  should  indeed  accept,  because  they  have  a 
strategy  that  guarantees  that  they  will  each  get  $1,000  from  Abby.  The  first  step  is  to 
represent  the  set  of  possible  states  and  the  information  partitions.  We  will  describe  a 
state  by  a triple  abc,  where  a is  the  number  picked  by  Abby,  b is  the  number  given 
to  Bruno  and  c is  the  number  given  to  Caroline.  Bruno  only  observes  b and 
Caroline  only  observes  c.  Thus  the  information  partitions  are  as  shown  in  Figure 
7.8. 
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Bruno: 


2 13 

k > 

231  435 

45  3 65  7 

r \ 

675 

Figure  7.8 


Let  us  see  if  the  following  would  be  a successful  strategy  for  Bruno  and  Caroline:  “if 
Bruno  gets  a f or  a 3 we  will  write  the  pair  (f  ,3)  and  if  Bruno  gets  a 5 or  a 7 we  will 
write  the  pair  (5,7)”.  Consider  the  event  “Bruno  gets  a f or  a 3”;  call  this  event  E. 
Then  E = {2 13,  23 1,  435} . If  this  event  occurs  (e.g.  because  the  actual  state  is  213), 
will  it  be  common  knowledge  between  Bruno  and  Caroline  that  it  occurred?  It  is 
straightforward  to  check  that  (B  stands  for  Bruno  and  C for  Caroline)  KBE  = E , 
KCE  = {231}  and  thus  KBKCE  = 0 . Hence,  while  Bruno,  of  course,  will  know  if 
he  gets  a 1 or  a 3,  Caroline  might  know  (if  the  state  is  231)  or  she  might  not  know 
(that  happens  if  the  state  is  213  or  435),  but  Bruno  will  never  know  that  Caroline 
knows.  Thus  event  E is  far  from  being  common  knowledge,  if  it  occurs.  It  is  easy  to 
check  that  the  same  is  true  of  the  event  “Bruno  gets  a 5 or  a 7”.  In  order  to  be 
successful,  a coordination  strategy  must  be  based  on  events  that,  when  they  occur, 
are  commonly  known. 


Let  us  now  consider  an  alternative  strategy,  namely 

“if  Bruno  gets  a 1 or  a 5 let  us  write  the  pair  (1,5)  (♦) 

and  if  Bruno  gets  a 3 or  a 7 let  us  write  the  pair  (3,7)” 

Let  F be  the  event  “Bruno  gets  a 1 or  a 5”,  that  is,  F = {213,  453,  657} . Then 
KbF  = F and  KCF  = F , so  that  KB  KCF  = F , Kc  KBF  = F , KB  KCKBF  = F , 

= F =F  ' T7  ' 
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Kc  KbKcF  = F and  so  on.  Hence  CKF  = F , that  is,  if  event  F occurs  then  it  is 

= F 

common  knowledge  between  Bruno  and  Caroline  that  it  occurred.  Similarly, 
letting  G = {231,  435,  675}  be  the  event  “Bruno  gets  a 3 or  a 7”  we  have  that 
CKG  = G.  Hence  strategy  (♦)  will  be  a successful  coordination  strategy,  since  the 
conditioning  events,  when  they  occur,  are  common  knowledge  between  Bruno  and 
Caroline. 

In  the  above  example  we  were  able  to  show  directly  that  an  event  was  common 
knowledge  at  a given  state  (for  example,  we  showed  that  for  every  tv  e F , 
we  CKF).  We  now  show  a faster  method  for  computing,  for  every  event  E,  the 
event  CKE . The  crucial  step  is  to  derive  from  the  individuals’  information 
partitions  a new  partition  of  the  set  of  states  which  we  call  the  common  knowledge 
partition. 


Definition  7.4.  Consider  a set  of  states  W and  n partitions  Il,I2,...,Jn  of  W. 
As  usual,  if  w is  a state,  we  denote  by  /.(w)  the  element  (information  set)  of  the 
partition  Ij  that  contains  w.  Given  two  states  vv,  vv'  e W , we  say  that 

• tv'  is  reachable  from  w in  one  step  if  there  exists  an  i e {1 ,...,«}  such  that 
w'  e I.  ( vv) . 

• vv'  is  reachable  from  w in  two  steps  if  there  exists  a state  xeW  such  that 
x is  reachable  from  w in  one  step  and  tv'  is  reachable  from  x in  one 

4 

step. 

• In  general,  tv'  is  reachable  from  w in  m steps  {m  > 1)  if  there  is  a sequence 
of  states  (tVptVj,...,^,)  such  that  (1)  tVj=tv,  (2)  tvm  = tv'  and  (3)  for 
every  k = 2 ,...,/«  , wk  is  reachable  from  wk]  in  one  step. 

Finally,  we  say  that  tv'  is  reachable  from  w if,  for  some  m > 1,  tv'  is  reachable 
from  w in  m steps. 


4 Thus  w'  is  reachable  from  w in  two  steps  if  there  exist  xeW  and  i,je{l,...,n}  such  that 
xe/.(w)  and  w'elj(x). 
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Definition  7.5.  Consider  a set  of  states  W and  n partitions  Iv In  of  W. 
Given  a state  w,  the  common  knowledge  information  set  that  contains  w,  denoted  by 
I CK  ( w) , is  the  set  of  states  reachable  from  w.  The  common  knowledge  information 
partition  is  the  collection  of  common  knowledge  information  sets. 


Example  7.1  continued.  Let  us  go  back  to  Example  7.1,  where  there  are  two 
individuals,  Bruno  and  Caroline,  with  the  following  information  partitions: 


Bruno: 


2 13 


y y 

675 

v / 


Applying  Definition  7.5  we  have  that  /CJC(213)  = 7^(453)  = ICK(651)  = 

{213,453,657} 5 and  /CJC(231)  = /CJC(435)  = /CJf(675)  = {231,435,675}.  Thus  the 
common  knowledge  partition  is  as  shown  in  Figure  7.9 


Figure  7.9 


5 In  fact,  213  is  reachable  from  itself  in  one  step  (through  either  Bruno  or  Caroline),  453  is 
reachable  in  one  step  from  213  (through  Bruno)  and  657  is  reachable  in  two  steps  from  213 
(the  first  step  - to  453  - through  Caroline  and  the  second  step  - from  453  - through  Bruno). 
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As  a second  example,  consider  the  information  partitions  of  Figure  7.2,  reproduced 
below: 


Ann 


Bob 


a 


b 


c 


g 


h 


Carol 


The  corresponding  common  knowledge  partition  is  shown  in  Figure  7.10. 


b 


c 


f 


g 


h 


Figure  7.10 

The  following  theorem  states  that,  determining  whether  an  event  E is  common 
knowledge  at  a state  w,  is  equivalent  to  determining  whether  an  individual  whose 
information  partition  coincided  with  the  common  knowledge  partition  would 
know  E at  w. 

Theorem  7.1.  At  state  weW  event  E <^W  is  common  knowledge  (that  is, 
w e CKE ) if  and  only  if  ICK  (w)  cz  E . Thus  the  we  can  define  the  common 

knowledge  operator  CK  : 2"  — > 2"  as  follows:  CKE  = {weW  : ICK{w)  c:  E) . 

Example  7.1  continued.  Let  us  go  back  to  Example  7.1  about  Bruno  and  Caroline. 
Let  F be  the  event  that  Bruno  gets  a 1 or  a 5:  F = {213,  453,  657}  . Then,  using 
Theorem  7.1,  CKF  = F because  /CjC(213)  = 7CJr(453)  = 7CJr(657)  = 
{213,453,657};  thus  - confirming  what  we  found  in  Example  7.1  - at  any  state 
where  Bruno  gets  a 1 or  a 5 it  is  common  knowledge  between  Bruno  and  Caroline 
that  Bruno  got  a 1 or  a 5. 
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Now  let  H be  the  event  “Bruno  did  not  get  a 5”,  that  is,  H = {213,231,435,675}. 
Then,  using  Theorem  7.1  we  have  that  CKH  = {231, 435, 675} . Thus  while  at  state 
231  Bruno  does  not  get  a 5 and  this  is  common  knowledge  between  Bruno  and 
Caroline,  at  state  213  Bruno  does  not  get  a 5 but  this  is  not  common  knowledge 
between  Bruno  and  Caroline  (in  fact  213  £ KCaroUneH  = {231,435,675}). 

As  a last  example,  consider  again  the  information  partitions  of  Figure  7.2,  whose 
common  knowledge  partition  was  shown  in  Figure  7.10  and  is  reproduced  below: 


a 


b 


c 


f 


g 


h 


Let  E = {a,b,c,d,e,f} . Then  CKE  = {a,b,c,d}.  Let  F = {a,b,fg,h}}.  Then  CKF  = 
0.  The  smallest  event  that  is  common  knowledge  at  state  a is 
ICK(a ) = {a,b,c,d}  and  the  smallest  event  that  is  common  knowledge  at  state  g 
is  ICK(g)  = {e,f,g,h}  . 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  7.E.3  of  Appendix  7.E  at  the 
end  of  this  chapter. 
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Appendix  7.E:  Exercises 

7.E.I.  Exercises  for  Section  7.1:  Individual  knowledge 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  7.1.  You  are  in  a completely  dark  room,  where  you  cannot  see  anything. 

You  open  a drawer  that  you  know  to  contain  individual  socks,  all  of  the  same  size 

but  of  different  colors.  You  know  that  5 are  blue  and  7 are  white. 

(a)  First  use  your  intuition  to  answer  the  following  question.  What  is  the  smallest 
number  of  socks  you  need  to  take  out  in  order  to  know  (that  is,  to  be 
absolutely  certain)  that  you  have  a matching  pair  (i.e.  either  a pair  of  blue  socks 
or  a pair  of  white  socks)? 

(b)  Now  represent  this  situation  using  states  and  information  sets.  Do  this  to 
(1)  represent  the  situation  after  you  have  taken  out  one  sock,  (2)  represent  the 
situation  after  you  have  taken  out  two  socks  and  (3)  represent  the  situation  after 
you  have  taken  out  three  socks.  Remember  that  a state  should  encode  a 
complete  description  of  the  relevant  aspects  of  the  situation;  in  particular,  the 
state  should  tell  us  how  many  socks  you  have  taken  out  and  the  color  of  each 
sock  that  you  have  taken  out  (thus  the  set  of  states  changes  over  time  as  you 
take  out  more  socks). 

Exercise  7.2.  Consider  the  situation  described  in  Exercise  7.1:  the  room  is  dark  and 

you  have  taken  out  three  socks.  Consider  the  following  alternative  scenarios. 

(a)  Somebody  tells  you  the  color  of  the  third  sock  (but  you  still  don’t  know 
the  color  of  the  other  two  socks).  Represent  your  state  of  knowledge  by 
means  of  an  information  set. 

(b)  Somebody  tells  you  the  color  of  the  matching  pair  (but  you  don’t  know 
what  socks  you  picked).  Represent  your  state  of  knowledge  by  means  of  an 
information  set. 

Exercise  7.3.  Let  the  set  of  states  be  W = {a,  b,  c,  d,  e,  f,  g,  h,  k,  m,  n]  and  the 

information  partition  of  an  individual  be  {{a,b,c},  {d},  {e,f,  g,  h},  {k,m},  {«}}. 

Consider  the  following  event:  E = {a,b,d,k,n}. 

(a)  Does  the  individual  know  E at  state  a ? 

(b)  Does  the  individual  know  E at  state  c? 

(c)  Does  the  individual  know  E at  state  d? 

(d)  Does  the  individual  know  E at  state  h} 

(e)  Does  the  individual  know  E at  state  k? 

(f)  Let  KE  denote  the  event  that  the  individual  knows  E (that  is,  the  set  of 
states  where  the  individual  knows  E).  What  is  KE} 
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(g)  [Recall  that,  given  an  event,  say  F,  denote  by  — F be  the  complement  of  F, 
that  is,  the  set  of  states  that  are  not  in  F .]  Once  again,  let  E = {a,b,d,k,n}. 
What  is  the  event  —KE,  that  is  the  event  that  the  individual  does  not  know 
E ? What  is  the  event  K—KE,  that  is,  the  event  that  the  individual  knows 
that  she  does  not  know  E ? 

Exercise  7.4.  The  famous  pirate  Sandokan  has  captured  you  and  put  you  in 
front  of  three  chests  containing  coins.  Chest  number  1 is  labeled  “gold,”  chest 
number  2 is  labeled  “bronze,”  and  chest  number  3 is  labeled  “gold  or  silver.” 
One  chest  contains  gold  coins  only,  another  contains  silver  coins  only,  and  the 
third  bronze  coins  only. 

(a)  Represent  the  set  of  possible  states  in  the  case  where  the  labels  might  or 
might  not  be  correct  (a  state  must  describe  the  label  and  content  of  each 
box). 

(b)  Let  E be  the  event  “Box  number  f is  mislabeled”  (that  is,  what  the  label 
says  is  false).  What  states  are  in  event  E ? 

(c)  Let  F be  the  event  “Box  number  2 is  mislabeled”.  What  states  are  in  event  F} 

(d)  What  is  the  event  “both  boxes  f and  2 are  mislabeled”? 

(e)  Suppose  now  that  Sandokan  tells  you  that  all  the  chests  are  falsely  labeled, 

that  is,  what  the  label  says  is  false  (for  example,  if  it  says  “gold”  then  you 
can  be  sure  that  the  chest  does  not  contain  gold  coins).  If  you  correctly 
announce  the  content  of  all  the  chests  you  will  be  given  a total  of  $1,000.  If 
you  make  a mistake  (e.g.  state  that  a chest  contains,  say,  gold  coins  while  in 
fact  it  contains  bronze  coins)  then  you  don’t  get  any  money  at  all.  You  can 
open  any  number  of  chests  you  like  in  order  to  inspect  the  content. 
However,  the  first  time  you  open  a chest,  you  have  to  pay  $500,  the  second 
time  $300,  the  third  time  $100.  (e.l)  What  is  the  set  of  possible  states 

(assuming  that  Sandokan  told  you  the  truth)?  (e.2)  What  is  the  maximum 
amount  of  money  you  can  be  absolutely  certain  to  make? 

Exercise  7.5.  Prove  each  of  the  following  properties  of  the  knowledge 
operator.  The  proofs  are  straightforward  applications  of  Definition  7.2. 

Truth:  KE  c;  E,  that  is,  if  at  a state  w one  knows  E,  then  E 

is  indeed  true  at  w. 

Consistency:  KE  n K— E = 0,  that  is,  one  never  simultaneously 

knows  E and  also  —E  (— E denotes  the  complement 
of  F). 
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Positive  introspection:  KE  cz  KKE,  that  is,  if  one  knows  E then  one  knows 

that  one  knows  E (one  is  fully  aware  of  what  one 
knows).  [Note  that  it  follows  from  this  and  Truth 
that  KE  = KKE,  because  from  Truth  we  get  that 
KKE  c KE.] 

Negative  Introspection:  — , KE  c K—,KE,  that  is,  if  one  does  not  know  E,  then 

one  knows  that  one  does  not  know  E (one  is  fully 
aware  of  what  one  does  not  know).  [Note  that  it 
follows  from  this  and  Truth  that  — KE  = K-^KE, 
because  from  Truth  we  get  that  K—KE  ci  — i KE.\ 

If  E c:  F,  then  KE  ci  KE,  that  is,  if  E implies  F then  if 
one  knows  E then  one  knows  F. 

KE  n KF  = K(E  n F),  that  is,  if  one  knows  E and 
one  knows  F,  then  one  knows  E and  F,  and  vice 
versa. 


Monotonicity: 

Conjunction: 


7.E.2.  Exercises  for  Section  7.2:  Interactive  knowledge 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  7.6.  Let  the  set  of  states  be  W = {a,  b,  c,  d,  e,  f,  g,  h}.  There  are  three 
individuals  with  the  following  information  partitions: 


Ann 

a 

\ b c \ 

1 d 

[e  f 

g \h  ] 

Bob 

a 

b \ \ c 

d 

i^J  1 1] 

\g  h\ 

Carol 

a 

\b  j f c 

d 

e \f  g] 

Consider  the  event  E = {a,b,c,fg}.  Find  the  following  events. 

(a)  Ka„„E  (lhc  event  that  Ann  knows  £).  (b ) KBohE,  (c ) KCarotE, 

(d)  KCarol  KAnnE  (the  event  that  Carol  knows  that  Ann  knows  E), 

(e)  KgJC Carol  KAnnE,  (f)  KAnn  ^KHohKGirolE  (the  event  that  Ann  knows  that  Bob 

does  not  know  that  Carol  knows  E). 
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Exercise  7.7.  Dan  is  at  the  airport.  He  calls  his  office  and  leaves  a voice 
message  that  says:  “My  flight  was  cancelled  and  I am  waiting  to  see  if  they  can 
re-route  me  through  Los  Angeles  or  San  Francisco.  I will  call  one  of  you  at 
home  tonight  at  8:00  sharp  to  let  that  person  know  whether  I am  in  San 
Francisco  or  Los  Angeles.”  Dan’s  office  staff,  consisting  of  Ann,  Barb  and 
Carol,  were  out  for  lunch.  When  they  come  back  they  listen  to  the  message 
together.  They  leave  the  office  at  5:00  pm  and  each  goes  to  her  home. 

(a)  Using  information  partitions,  represent  the  possible  states  of  knowledge  of 
Ann,  Barb  and  Carol  concerning  Dan’s  whereabouts  at  8:15  pm,  after  Dan’s 
call  (there  has  been  no  communication  among  Ann,  Barb  and  Carol  after 
they  left  the  office). 

(b)  Let  E be  the  event  that  Dan  calls  either  Barb  or  Carol.  What  states  are  in  E ? 

(c)  For  the  event  E of  part  (b),  find  K AE  (Ann  knows  E),  KBK4E  (Barb  knows 
that  Ann  knows  E)  and  — i KCE  (Carol  does  not  know  E or  it  is  not  the  case 
that  Carol  knows  E). 

(d)  For  the  event  E of  part  (b),  find  a state  x where  all  of  the  following  are  true: 
(f)  at  x Ann  knows  E,  (2)  at  x Barb  knows  that  Ann  knows  E,  (3)  at  x it  is 
not  the  case  that  Carol  knows  E. 

Exercise  7.8.  A set  of  lights  is  controlled  by  two  switches,  each  of  which  can  be  in 
either  the  Up  position  or  in  the  Down  position.  One  switch  is  in  room  number  1, 
where  Ann  is;  the  other  switch  is  in  room  number  2,  where  Bob  is.  The  lights  are 
in  room  number  3,  where  Carla  is.  There  are  two  lights:  one  red  and  one  green. 
The  red  light  is  on  if  the  two  switches  are  in  different  positions  (one  up  and  the 
other  down:  it  doesn’t  matter  which  is  up  and  which  is  down),  while  the  green  light 
is  on  if  the  two  switches  are  in  the  same  position  (both  up  or  both  down).  All  this  is 
common  knowledge  among  Ann,  Bob  and  Carla. 

(a)  Represent  the  possible  states  (you  need  to  specify  the  position  of  each 
switch  and  which  light  is  on). 

(b)  Represent  the  possible  states  of  information  of  Ann,  Bob  and  Carla  by 
means  of  information  partitions. 

(c)  Let  G be  the  event  “the  green  light  is  on”.  Find  the  events  G,  KAG  (Ann 
knows  G),  KbG  (Bob  knows  G),  KCG  (Carla  knows  G). 

(d)  Let  L be  the  event  “either  the  green  light  is  on  or  the  red  light  is  on”. 
Find  the  events  E,  KAE  (Ann  knows  L ),  KgL  (Bob  knows  L),  (Carla 

knows  L ) . 
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7.E.3.  Exercises  for  Section  7.3:  Common  knowledge 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this  chapter. 

Exercise  7.9.  In  Exercise  7.6, 

(a)  find  the  common  knowledge  partition, 

(b)  find  the  event  CKE  (that  is,  the  event  that  E is  common  knowledge;  recall  that 
E = {a,b,c,f,g}) 

(c)  Letting  F = {a,b,c,d,e,g},  find  CKE,  that  is,  the  event  that  F is  common 

knowledge. 

Exercise  7.10.  In  Exercise  7.7, 

(a)  find  the  common  knowledge  partition, 

(b)  find  the  event  CKE  (where  E is  the  event  that  Dan  calls  either  Barb  or  Carol). 

Exercise  7.11.  In  Exercise  7.8, 

(a)  find  the  common  knowledge  partition, 

(b)  find  the  event  CKG  (where  G is  the  event  “the  green  light  is  on”), 

(c)  find  the  event  CKE  (where  L is  the  event  “either  the  green  light  is  on  or  the  red 
light  is  on”). 

Exercise  7.12.  The  set  of  states  is  W = {a,  b,  c,  d,  e,  f,  g,  h}.  There  are  four 
individuals  with  the  following  information  partitions: 

Individual  1:  { {a,  b}  , {c}  , {d}  {g}  , { h } } 

Individual  2:  { {a}  , {b,  c}  , {d , e}  , {/}  , {g}  , { h } } 

Individual  3:  { {a,  c}  , {b}  , {d}  , {e}  , {g}  ,{f,h}} 

Individual  4:  { {a}  , {b , c}  , {d , e}  , {/,  g}  , { h } } 

(a)  Let  E = {a,  c,  d,  e}.  Find  the  following  events:  K,E,  K2E,  K2E,  K4E  and 
Kx  K2  —i  K2E  (Recall  that  — i denotes  the  complement  of  a set,  that  is,  —F  is 

the  set  of  all  states  that  are  not  in  F). 

(b)  Find  the  common  knowledge  partition. 

(c)  At  what  states  is  event  E = {a,  c,  d,  e } common  knowledge? 

(d)  Let  F = {a,  b,  c,  d,  g,  h}.  Find  the  event  that  F is  common  knowledge,  that  is, 
find  CKF. 
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Exercise  7.13.  Amy,  Beth  and  Carla  are  now  in  Room  1.  They  are  asked  to 
proceed,  one  at  a time,  to  Room  3 through  Room  2.  In  Room  2 there  are  two  large 
containers,  one  with  many  red  hats  and  the  other  with  many  white  hats.  They  have 
to  choose  one  hat,  put  it  on  their  head  and  then  go  and  sit  in  the  chair  in  Room  3 
that  has  their  name  on  it. 


room  1 room  2 room  3 


I 

/ Carla 

Beth 

Amy 

white 

red 

/ 

hats 

hats 

n i 

n 

Amy  goes  first,  then  Beth  then  Carla.  The  chairs  are  turned  with  the  back  to  the  door. 

Thus  a person  entering  Room  3 can  see  whomever  is  already  seated  there,  but  cannot 

be  seen  by  them.  Beth  and  Carla  don’t  know  how  many  hats  there  were  in  each  box. 

(a)  Use  information  partitions  to  represent  the  possible  states  of  knowledge  of  Amy, 
Beth  and  Carla  after  they  are  seated  in  Room  3. 

(b)  Suppose  that  Amy  chose  a white  hat,  Beth  a red  hat  and  Carla  a white  hat.  Find 
the  smallest  event  that  is  common  knowledge  among  them.  Give  also  a verbal 
description  of  this  event. 

(c)  Repeat  Parts  (a)  and  (b)  for  the  modified  setting  where  there  is  a mirror  in  Room  3 
that  allows  Amy  to  see  the  hat  of  Carla  (but  not  that  of  Beth). 
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Exercise  7.14.  Challenging  Question. 

Pete  and  his  three  daughters  Elise,  Andrea  and  Theresa  are  in  the  same  room. 
Pete  gives  a sealed  envelope  to  Elise  and  tells  her  (in  a loud  voice,  so  that 
everybody  can  hear)  “in  this  envelope  I put  a sum  of  money;  I don’t  remember 
how  much  I put  in  it,  but  I know  for  sure  that  it  was  either  $4  or  $8  or  $12. 
Now,  my  dear  Elise,  go  to  your  room  by  yourself  and  open  it.  Divide  the 
money  into  two  equal  amounts,  put  the  two  sums  in  two  different  envelopes, 
seal  them  and  give  one  to  Andrea  and  one  to  Theresa”.  Unbeknownst  to  her 
sisters,  Elise  likes  one  of  them  more  than  the  other  and  decides  to  disobey  her 
father:  after  dividing  the  sum  into  two  equal  parts,  she  takes  $1  from  one 
envelope  and  puts  it  in  the  other  envelope.  She  then  gives  the  envelope  with 
more  money  to  her  favorite  sister  and  the  envelope  with  the  smaller  amount  to 
the  other  sister.  Andrea  and  Theresa  go  to  their  respective  rooms  and  privately 
open  their  envelopes,  to  discover,  to  their  surprise,  an  odd  number  of  dollars. 
So  they  realize  that  Elise  did  not  follow  their  father’s  instructions.  Neither 
Andrea  nor  Theresa  suspect  that  Elise  kept  some  money  for  herself;  in  fact,  it  is 
common  knowledge  between  them  that  Elise  simply  rearranged  the  money, 
without  taking  any  for  herself.  Of  course,  neither  Andrea  nor  Theresa  know  in 
principle  how  much  money  Elise  took  from  one  envelope  (although  in  some  cases 
they  might  be  able  to  figure  it  out).  Thus  it  is  not  common  knowledge  between 
Andrea  and  Theresa  that  Elise  took  only  $1  from  one  of  the  two  envelopes. 
Your  answers  should  reflect  this. 

(a)  Use  states  and  information  partitions  to  represent  the  possible  states  of 
knowledge  of  Andrea  and  Theresa 

(b)  Let  E be  the  event  that  Andrea  is  Elise’s  favorite  sister.  Find  the  events 
KaE  (the  event  that  Andrea  knows  it),  KjE  (the  event  that  Theresa 
knows  it),  KaKjE  and  KjKaE. 

(c)  Find  the  common  knowledge  partition. 

(d)  Is  there  a state  at  which  it  is  common  knowledge  between  Andrea  and 
Theresa  that  Elise’s  favorite  sister  is  Andrea? 

(e)  The  night  before,  Pete  was  looking  through  his  digital  crystal  ball  and 
saw  what  Elise  was  planning  to  do.  However  the  software  was  not 
working  properly  (the  screen  kept  freezing)  and  he  could  not  tell 
whether  Elise  was  trying  to  favor  Andrea  or  Theresa.  He  knew  he 
couldn’t  stop  Elise  and  wondered  how  much  money  he  should  put  in 
the  envelope  to  make  sure  that  the  mistreated  sister  (whether  it  be 
Andrea  or  Theresa)  would  not  know  that  she  had  been  mistreated.  How 
much  money  should  he  put  in  the  envelope? 
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Appendix  7.S:  Solutions  to  exercises 

Exercise  7.1.  (a)  The  answer  is  that  you  need  to  take  out  three  socks, 

(b)  Describe  a state  by  the  number  of  socks  you  have  taken  out  and  the  color  of 
each  sock  that  you  have  taken  out. 


( 1 1 I 

After  you  have  taken  out  only  one  sock : (the  information  set 

B W 

^ J 

captures  the  fact  that  you  cannot  see  the  color  of  the  sock  because  it  is  dark) 


After  you  have  taken  out  two  socks: 


2 2 

B W 

W B 


After  you  have  taken  out  three  socks: 


3 3 

B W 
W B 
W B 


3 

W 

B 

W 


3 

W 

W 

B 


Now  at  every  state  there  are  (at  least)  two  socks  of  the  same  color,  thus  you 
know  that  you  have  a matching  pair,  even  though  you  don’t  know  what  state 
you  are  in  and  hence  you  don’t  know  the  color  of  the  matching  pair. 

An  alternative  (and  equivalent)  way  to  proceed  would  be  to  describe  the  state  as 
a quadruple  of  numbers  as  follows: 

# of  blue  socks  in  drawer  xl 

# of  white  socks  in  drawer  x2 

# of  blue  socks  in  your  hand  x3 

# of  white  socks  in  your  hand  x4 

Clearly,  it  must  always  be  that  x1  + x3  = 5 and  x2  +x4  = 7.  The  initial  state  is  x{ 
= 5,  x2  = 7,  x3  = x4  = 0.  Taking  one  sock  from  the  drawer  will  modify  the  state 
as  follows: 
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Since  it  is  dark  and  you  don’t  know  what 
color  sock  you  took,  you  cannot  distinguish 
between  the  two  states  and  we  have 
represented  this  by  enclosing  the  two  states  in 
a dashed  rectangle,  representing  an 
information  set. 


Now  taking  a second  sock  will  modify  the  state  as  follows. 

Since  you  could  not  distinguish 
between  the  two  initial  states, 
you  cannot  now  distinguish 
among  the  three  states  that 
represent  all  the  possibilities. 
In  two  of  the  states  you  have  a 
matching  pair,  but  you  cannot 
be  certain  that  you  have  a 
matching  pair  because  you 
might  be  in  the  middle  state 
where  you  have  one  blue  sock 
and  one  white  sock. 


Now  taking  a third  sock  will  lead  to: 


Now  you  know  that  you  have  a matching  pair  because  in  every  possible  state  you 
have  at  least  2 socks  of  the  same  color.  Thus  the  answer  is  indeed  that  you  need  to 
take  out  three  socks. 
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Exercise  7.2. 
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you  have  a blue  you  have  a white 

matching  pair  matching  pair 


Exercise  7.3.  (a)  No,  because  the  information  set  containing  a is  {a,b,c}  which 
is  not  contained  in  E = {a,b,d,k,n}. 

(b)  No,  because  the  information  set  containing  c is  {a,b,c}  which  is  not 
contained  in  E = {a,b,d,k,n}. 

(c)  Yes,  because  the  information  set  containing  d is  {d}  which  is  contained  in  E 

= {a,b,d,k,n}. 

(d)  No,  because  the  information  set  containing  h is  {e,fg,h}  which  is  not 
contained  in  E = {a,b,d,k,n}. 

(e)  No,  because  the  information  set  containing  k is  { k,m } which  is  not 
contained  in  E = {a,b,d,k,n}. 

(f)  KE  = {< d,n }. 

(g)  —i KE  = { a,b,c,e,fg,h,k,m  }.  K—JKE  = { a,b,c,e,fg,h,k,m  } = —KE. 

Exercise  7.4. 

(a)  Describe  a state  by  a triple  ((x1?  Vj ), (x2 , y2),(x3, y3))  where  xt  is  the  label  on 

box  number  i ( i = 1,2,3)  and  v,  is  the  content  of  box  number  i.  Then  the 
set  of  possible  states  is 

T = ((G,B),(B,G),(G  or  S,S))  , z2  = ((G,B),(B,S),(G  or  S,G)), 
z3  = ((G,G),(B,B),(G  or  S,S))  , z4  = ((G,G),(B,S),(G  or  S,B)), 
z5  = ((G,S),(B,B),(G  or  S,G)) , z6  = ((G,S),(B,G),(G  or  S,B)). 

Thus,  for  example,  state  z4  is  one  where  box  number  1 (which  is  labeled 
‘gold’)  in  fact  contains  gold  coins,  box  number  2 (which  is  labeled 
‘bronze’)  as  a matter  of  fact  contains  silver  coins,  and  box  number  3 
(which  is  labeled  ‘gold  or  silver’)  as  a matter  of  fact  contains  bronze 
coins.  More  simply,  we  could  write  a state  as  a triple  (y],y2,y3)  denoting 
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the  content  of  each  box  (since  we  are  told  what  the  labels  are).  In  this 
simpler  notation  the  states  are: 

z,  = (B,G,S),  z2  = (B,S,G),  z3  = (G,B,S), 
z4  = (G,S,B),  z5  = (S,B,G),  z6  = (S,G,B). 

(b)  E = {(B,G,S),  (B,S,G),  (S,B,G),  (S,G,B)}  (recall  that  the  label  says  “gold”). 

(c)  F = {(B,G,S),  (B,S,G),  (G,S,B),  (S,G,B)}  (recall  that  the  label  says 
“bronze”). 

(d ) EnF  = {(B,G,S),  (B,S,G),  (S,G,B)}. 

(e)  Of  all  the  states  listed  above,  only  state  z6  is  such  that  all  the  labels  are 
false.  Hence  Sandokan’s  statement  reduces  the  set  of  states  to  only  one: 
z6  = (S,G,B).  This  is  because  the  label  “gold  or  silver”  must  be  on  the 
chest  containing  the  bronze  coins.  Hence  we  are  only  left  with  gold  and 
silver.  Then  silver  must  be  in  the  chest  labeled  “gold”.  Hence  gold  must 
be  in  the  chest  labeled  “bronze”.  Thus,  by  looking  at  the  labels  you  can 
correctly  guess  the  content  without  opening  any  chests:  you  know  that 
the  true  state  is  (S,G,B).  So  your  expected  payoff  is  $1,000  (you  are  not 
guessing  at  random,  you  are  deducing  by  reasoning). 

Exercise  7.5. 

Truth:  KE  c E.  Consider  an  arbitrary  w e KE.  We  have  to  show  that  w e E. 
Since 

w e KE,  by  Definition  7.2  I(w ) c:  E.  Since  w e I{w),  it  follows  that  w e E. 

Consistency:  KE  n K— E = 0.  Suppose  that  w e KE  n K— E for  some  w and 
some  E.  Then,  by  Definition  7.2,  I(w)  c:  E (because  w e KE)  and  I{w)  c:  — E 
(because  w e— E)  and  thus  I{w)  ci  E n — E.  Since  E n — E = 0,  this  implies  that 
I(w)  = 0,  which  is  not  true  because  w e I{w). 

Positive  introspection:  KE  c:  KKE.  Consider  an  arbitrary  w e KE.  We  need  to 
show  that  w e KKE,  that  is,  that  I(w ) c KE  which  means  that  w'e  KE  for 
every  w'e  I(w).  Since  w e KE,  by  Definition  7.2,  I(w ) c E.  Consider  an 
arbitrary  w'  e I(w) . By  definition  of  partition,  I(w')  = I(w).  Thus  I(w')  cz  E,  that 
is,  by  Definition  7.2,  w ' e KE. 

Negative  Introspection:  — i KE  c K—>KE.  Consider  an  arbitrary  w e — KE.  We 
need  to  show  that  w e K—^KE,  that  is,  that  I(w ) c — i KE.  By  Definition  7.2, 
since  I{w)  c — KE,  I(w)  n — E ^ 0.  Consider  an  arbitrary  w' e I(w)\  then,  since 
(by  definition  of  partition)  I{w')  = I(w),  I(w')  n —E  Y 0 so  that  w' <e  — , KE . 
Thus  we  have  shown  that,  for  every  w'e  I(w),  w'e  —KE,  that  is,  I(w)  ci  —KE 
which,  by  Definition  7.2,  yields  w e K—KE. 
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Monotonicity:  if  E cz  F,  then  KE  c:  KF.  Consider  an  arbitrary  w e KE.  We 
need  to  show  that  w e KF.  Since  w e KE,  by  Definition  7.2,  I(w ) cz  E.  Hence, 
since  - by  hypothesis  - E czF,  I(w)  ci  F,  that  is,  by  Definition  7.2,  w e KF. 

Conjunction:  KE  n KF  = K{E  n T).  Let  w e KE  n AT7.  Then  w e KE  and  w e 
AT7;  by  Definition  7.2,  the  former  implies  that  7(w)  ci  E and  the  latter  that  I(w) 
ci  T7,  so  that  I(w ) ci  E C\  F and  hence,  by  Definition  7.2,  w e A(E  n T). 
Conversely,  suppose  that  w e K(E  n T).  Then,  by  Definition  7.2,  7(w)  c£nf 
and  thus  7(w)  c:  E and  7(w)  cf  so  that,  by  Definition  7.2,  w e KE  and  w e AT7; 
hence  w e KE  n AT7. 

Exercise  7.6.  (a)  KAnnE  = {a,b,c},  (b)  KBobE  = {a,b,f},  (c)  KCarotE  = 

{b>c,fg}> 

(d)  KCarol  KAnnE  = KCaJa,b,c}  = {b,c},  (e)  ABo^ro/  KAnnE  = KBJb,c}  = 0 

(0  KAnn  ^KBobKcarolE  = K Ann  ^KBob(^,fg}  = KAm  ~n{f}  = KAnJa,b,C,d,e,g,h}  = 

{a,b,c,d,h}. 


Exercise  7.7.  (a)  We  can  represent  a state  as  a pair  (x,y)  where  x is  the  city 
where  Dan  is  (SF  for  San  Francisco  or  LA  for  Los  Angeles)  and  y is  the  person 
that  Dan  called  ( A for  Ann,  B for  Barb  and  C for  Carol).  The  information 
partitions  are  as  follows: 


ANN 

(SF,A) 

(LA, A)  | 

f (SF,B) 

(LA,B) 

(SF,C) 

(LA,C) 

BARB 

’ (SF,A) 

(LA, A) 

( (SF.B) 

s'  A 

(LA,B) 

(SF,C) 

(LA,C) 

J 

CAROL  (SF,A)  (LA, A)  (SF,B) 

(LA,B) 

(SF  ,C)  J 

[(LA,C) 

(b)  E = {(SF,B),  (LA,B),  (SF,C),  (LA,C)}. 

(c)  KAJi  = E,  K„JCAJ  - «SF,B),  (LA,B»  and  KCmf.  - {(SF.C),  (LA,C)J  so 
that  - !(SF,A),  (LA, A),  (SF,B),  (LA,B)}. 
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(d)  We  want  a state  x such  that  xeKAnnE,  x e KBarbKAnnE  and  x e — <KCarolE 
(that  is,  x g KCarolE).  There  are  only  two  such  states:  (SF,B)  and  (LA,B). 
Thus  either  x = (SF,B)  orx  = (LA,B). 


X 

Exercise  7.8.  (a)  We  can  represent  a state  as  a triple  v where  x is  the  position 

Z 

of  the  switch  in  room  1 (Up  or  Down),  y is  the  position  of  the  switch  in  room 
2 and  z is  the  light  which  is  on  (Green  or  Red): 


(b) 


u 

u 

D 

D 

u 

D 

U 

D 

G 

R 

R 

G 

U 

U 

D 

\ 

D 

Ann 

u 

D 

U 

D 

G 

R 

R 

G 

Bob 


Carla 


K \ 

U 

U D 

D 

U 

D U 

D 

G 

R Rj 

G 

u 

D 

u 

D 

(c)  G = - 

u 

D 

>.  K4G  = 0,  KbG  = 0,  KCG  = • 

u 

D 

G 

G 

G 

G 

(d)  L is  the  set  of  all  states.  Hence  KAL  = KBL  = KCL  = L. 
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Exercise  7.9.  (a)  The  common  knowledge  partition  is: 

j a b c d J j e / g h \ 

(b)  CKE  = 0 (where  E = {a,b,c,f,g}). 

(c)  CKE  = {a,b,c,d}  (where/7  = {a,b,c,d,e,g}). 

Exercise  7.10.  (a)  The  common  knowledge  partition  is: 

(SF,A)  (LA, A)  (SF,B)  (LA,B)  (SF,C)  (LA,C) 

(b)  CKE  = 0. 

Exercise  7.11.  (a)  The  common  knowledge  partition  is: 

U U D D 

U D U D 

G R R G 

(b)  CKG  = 0. 

(c)  CKL  = L. 

Exercise  7.12.  (a)  Let  E = {a,  c,  d,  e }.  Then:  KXE  = (c,  d},  KJL  = {a,  d,  e }, 
K:E  = {a,  c,  d,  e },  K4E  = {a,  d,  e}  and  KlK1— i K-_E  = {g , h}  [in  fact,  — i K3E  = 
{b,  f g,  h},  so  that  K2  K}E  = {/ g,  h]  and  KJK2  K3E  = (g , h}\ 

(b)  The  common  knowledge  partition  is  { {a , b , c}  , {d,  e,  f,  g,  h}  } 

(c)  At  no  state  is  event  E = {a,  c,  d,  e } common  knowledge:  CKE  = 0. 

(d)  CKE  ={a,b,c}  (where  F = {a,  b,  c,  d,  g,h}). 

Exercise  7.13.  (a)  Represent  the  state  as  a triple  of  letters,  where  the  top  letter 
denotes  the  color  of  Amy’s  hat,  the  second  letter  the  color  of  Beth’s  hat  and  the 
bottom  letter  the  color  of  Carla’s  hat.  Each  of  them  knows  which  hat  she 
chose;  furthermore,  Beth  can  see  Amy’s  hat  and  Carla  can  see  everything.  Thus 
the  partitions  are  as  follows. 
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w 

\ A/ 

w 

\ A/ 

W 

n 

w 

n 

Amy 

vv 

w 

VV 

R 

K 

w 

K 

R 

R 

D 

D 

R 

\A/ 

R 

\A/ 

K 

R 

K 

wy 

VV 

R 

VV 

w 

fyC 

w 

\ A/ 

w 

D 

w 

D 

R 

D 

D 

R 

\ A/ 

\ A/ 

Beth  ^ 

VV 

R 

K 

w> 

K 

R 

K 

R 

K 

w> 

VV 

R 

VV 

Carla 


w 

\ A/ 

w 

\ A/ 

w 

D 

w 

D 

VV 

w 

VV 

R 

K 

w> 

K 

R 

R 

D 

IP 

D 

R 

\A/ 

\ A/ 

K 

R 

K 

w> 

VV 

R 

VV 

Common 

knowledge 

partition 


(b)  When  the  true  state  is  (W,  R,  W),  the  smallest  event  that  is  common 
knowledge  among  them  is  the  first  information  set  of  the  common 
knowledge  partition,  that  is,  the  fact  that  Amy  has  a white  hat. 

(c)  In  the  modified  setting  the  information  partitions  are  as  follows: 
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The  common  knowledge  partition  is  the  same  as  before,  hence  the  smallest 
event  that  is  common  knowledge  among  all  three  of  them  is  that  Amy  has  a 
white  hat. 

Exercise  7.14  (Challenging  Question),  (a)  Describe  the  state  by  three 
numbers  where  the  top  number  is  the  amount  of  money  that  Dan  put  in  the 
envelope,  the  middle  umber  is  the  amount  given  to  Andrea  and  the  bottom 
number  is  the  amount  given  to  Theresa.  As  a matter  of  fact,  each  sister  can  only 
find  either  $1  or  $3  or  $5  or  $7  in  the  envelope.  Thus  the  objectively  possible 


states  are: 

M3 

1 , 

M3 
3 , 

f S3 
3 , 

f S3 
5 , 

T2> 

5 

and 

T2> 

7 

. However,  besides  these  there  are  also 

uj 

UJ 

uj 

UJ 

MJ 

f83 

fs3 

T2n 

r\2s 

'\2^ 

T2> 

subjectively  possible  states,  namely 

1 , 

hi 

7 , 

hi 

1 , 

w 

11  , 

UJ 

3 

,97 

and 

9 

v3; 

(because 

Andrea  and  Theresa  don’t  know  that  Elise  only  transferred  $1  from  one 
envelope  to  the  other).  The  information  partitions  are  as  follows: 


' 

7Y 

'8' 

G23 

7 4' 

'12n 

' 

78' 

'12n 

' 

78' 

'127 

' 

7 12^ 

' 

" 'l2N 

' 

ANDREA: 

: < 

l 

, 

1 

, 

i 

>,  - 

3 

, 

3 

, 

3 

>,  < 

5 

, 

5 

>,  < 

7 

, 

7 

9 

< 

11 

>■  [ 

.A 

A 

lib 

1 

My 

Ay 

- 

.My 

My 

- 

.My 

My 

.My 

- 

.My 

■■ 

'r 

f 43 

'83 

'127 

1 

r 

M3 

V 

'12V 

r 

M3 

'12n' 

r 

['S3 

'12V 

| • 

'l2N 

r 

'l2N 

THERESA:- 

< 

l 

9 

5 

, 

9 

1 

1 

>•,  < 

3 

, 

7 

, 

11 

>,  • 

3 

, 

7 

• 

1 

, 

5 

3 

• 

1 

> > 

,3. 

7, 

mJ 

J 

My 

My 

My. 

My 

Ya 

My 

My. 

. 

My. 

Mb. 

(b)  The  event  that  Andrea  is  Elise’s  favorite  sister  is 


M3 

M3 

'87 

'127 

'127 

'127 

< 

3 

, 

5 

, 

7 

, 

7 

9 

9 

, 

11 

y 

UJ 

h) 

uj 

UJ 

UJ 

l 1 7 

( 83 

'12n 

'12n 

'l2N 

1 

M3 

( 83 

'12' 

'12Y 

kae  = - 

7 

7 

9 

11 

II 

ML 

3 

7 

11 

-,  KaK£  = - 

li 

My 

My 

My 

My 

My 

My 

My 

mJ. 

KtKaE  = 0 . 


287 


GAME  THEORY  - Giacomo  Bonanno 


(c)  The  common  knowledge  partition  is: 


(4) 

rs> 

12' 

rs> 

T2' 

12' 

'4' 

rs> 

12' 

rs> 

12' 

T2' 

- 

1 

9 

i 

9 

1 

9 

5 

9 

9 

9 

5 

3 

9 

3 

9 

3 

9 

7 

9 

11 

9 

7 

I 

A 

J) 

dk 

A 

,3; 

J ) 

Ik 

A 

,9; 

Ik 

V 1 2 

,5; 

(d)  At  no  state.  In  fact,  CKE  = 0. 

(e)  Dan  should  put  either  $8  or  $12  in  the  envelope.  If  he  were  to  put  $4,  then 
one  sister  would  end  up  with  $1  and  know  that  she  was  mistreated.  In  no 
other  case  does  a mistreated  sister  know  that  she  got  less  money  than  the 
other. 
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Chapter 


Adding  Beliefs  to  Knowledge 


8.1  Probabilistic  beliefs 

An  information  set  contains  all  the  states  that  an  individual  considers 
possible,  that  is,  the  states  that  the  individual  cannot  rule  out,  given 
her  information.  However,  of  all  the  states  that  are  possible,  the 
individual  might  consider  some  to  be  more  likely  than  others  and  might  even 
dismiss  some  states  as  “extremely  unlikely”  or  “implausible”.  For  example, 
suppose  that  there  are  only  three  students  in  a class:  Ann,  Bob  and  Carla.  The 
professor  tells  them  that  in  the  last  exam  one  of  them  got  95  points  (out  of  100), 
another  78  and  the  third  54.  We  can  think  of  a state  as  a triple  (a,b,c),  where  a is 
Ann’s  score,  b Bob’s  score  and  c Carla’s  score.  Then,  based  on  the  information 
given  by  the  professor,  Ann  must  consider  all  of  the  following  states  as 
possible:  (95,78,54),  (95,54,78),  (78,95,54),  (78,54,95),  (54,95,78)  and  (54,78,95). 
Suppose,  however,  that  in  almost  all  the  past  exams  Ann  and  Bob  always 
obtained  a higher  score  than  Carla  and  often  Ann  outperformed  Bob.  Then 
she  might  consider  states  (95,78,54)  and  (78,95,54)  much  more  likely  than 
(78,54,95)  and  (54,78,95).  To  represent  such  judgments  of  relative  likelihood  we 
add  to  an  information  set  a probability  distribution  over  the  states  in  the 
information  set.6  The  probability  distribution  expresses  the  individual’s  beliefs, 
while  the  information  set  represents  what  the  individual  knows.  In  this 


6 Recall  that  a probability  distribution  over  a finite  set  {x1,x2,...,xmj  is  an  assignment  of  a 
number  p,  to  every  element  x.  of  the  set  satisfying  two  properties:  (1)  for  every  i e 
{1,2 0<Pi<\  and  (2)  Pl  +P2+-  + Pm  = T 
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example,  Ann’s  beliefs  could  be  as  shown  in  Table  8.1. 

State  (95,78,54)  (95,54,78)  (78,95,54)  (54,95,78)  (78,54,95)  (54,78,95) 

probability  £ £ £ i?  0 0 

Table  8.1 


According  to  the  beliefs  expressed  by  the  probability  distribution  shown  in 
Table  8.1,  Ann  considers  it  very  likely  that  she  got  the  highest  score,  is  willing 
to  dismiss  the  possibility  that  Carla  received  the  highest  score  (she  assigns 
probability  0 to  the  two  states  where  Carla’s  score  is  95)  and  considers  it  much 
more  likely  that  she,  rather  than  Bob,  received  the  highest  score. 

Let  W be  a set  of  states  (which  we  can  think  of  as  an  information  set  of  an 
individual).  A probability  distribution  on  W is  a function  P:  W — >[0,1]  such  that 

P(w)  = 1 . As  usual,  we  call  the  subsets  of  W events.  The  probability  of  event 

wgW 

E cz  W , denoted  by  P(E) , is  defined  as  the  sum  of  the  probabilities  of  the 
elements  of  E-.  P(E)  = ^ P(w ) . For  instance,  in  the  example  of  Table  8.1,  the 

weE 

proposition  “Ann  received  the  highest  score”  corresponds  to  event  E = 
{(95, 78, 54), (95, 54, 78)}  and  - according  to  Ann’s  beliefs  - the  probability  of  that 
event  is  P(E)  = P((95, 78,54))  + P((95, 54,78))  = ^ = jf  ; on  the  other  hand, 

event  F = {(95, 78, 54), (78, 95, 54), (54, 95, 78)}  corresponds  to  the  proposition 
“Bob’s  score  was  higher  than  Carla’s  score”  and  - according  to  Ann’s  beliefs  - 
the  probability  of  that  event  is 

A(F)  = JP((95,78,54))  + JP((78,95,54))  + A((54,95,78))=  a + a + _l=I|  = i = 

75%. 


Definition  8.1.  We  say  that  an  individual  is  certain  of  event  E if  she  attaches 
probability  1 to  E,  that  is,  if  P(E)  = 1 . 


In  the  above  example,  Ann  is  certain  of  event  G = {(95,78,54),  (95,54,78), 
(78,95,54),  (54,95,78)},  corresponding  to  the  proposition  “Carla  did  not  get  the 
highest  score”. 
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Remark  8.1.  Note  the  important  difference  between  knowledge  and  certainty: 
if  at  a state  w the  individual  knows  an  event  E (we  KE)  then  at  that  state  E is 
indeed  true  ( weE ),  that  is,  it  is  never  the  case  that  an  individual  knows 
something  which  is  false;  on  the  other  hand,  an  individual  can  be  certain  of 
something  which  is  false,  that  is,  it  is  possible  that  the  individual  assigns 

g 

probability  1 to  an  event  E even  though  the  true  state  is  not  in  E. 

Continuing  the  example  of  the  exam  scores,  suppose  that  Ann  has  the  beliefs 
shown  in  Table  8.1  and  now  the  professor  makes  a new  announcement:  “I  was 
surprised  to  see  that,  unlike  past  exams,  Ann  did  not  get  the  highest  score.” 
This  new  announcement  by  the  professor  informs  the  students  that  the  true 
state  is  neither  (95,78,54)  nor  (95,54,78).  Thus  we  can  view  the  effect  of  the  new 
piece  of  information  as  shrinking  Ann’s  information  set  from  {(95,78,54), 
(95,54,78),  (78,95,54),  (54,95,78),  (78,54,95),  (54,78,95)}  to  {(78,95,54),  (54,95,78), 
(78,54,95),  (54,78,95)}.  How  should  Ann  revise  her  beliefs  in  response  to  the 
new  information?  The  answer  cannot  be  that  we  simply  drop  states  (95,78,54) 
and  (95,54,78)  from  Table  8.1,  because  the  result  would  be 

State  (78,95,54)  (54,95,78)  (78,54,95)  (54,78,95) 

probability  -^-^-0  0 

which  is  not  a probability  distribution,  since  the  probabilities  do  not  add  up  to 
1 (they  add  up  to  ^-).  The  topic  of  belief  revision  is  addressed  in  next  section. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  8.E.1  of  Appendix  8.E  at  the 
end  of  this  chapter. 


7 Indeed  it  was  proved  in  Exercise  7.5  (Chapter  7)  that,  for  every  event  E,  KE  c E. 

s In  the  above  example,  if  the  true  state  is  (78,54,95)  then  it  does  not  belong  to  event  G, 
representing  the  proposition  “Carla  did  not  get  the  highest  score”  and  yet  Ann  assigns 
probability  1 to  G,  that  is,  she  is  certain  of  G. 
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8.2  Conditional  probability,  belief  updating, 
Bayes’  rule 

We  start  by  reviewing  the  notion  of  conditional  probability. 

Definition  8.2.  Let  W be  a finite  set  of  states  and  P:W->[ 0,1]  a probability 
distribution  on  W.  Let  weW  be  a state  and  E czW  an  event  such  that 
P(E)  > 0.  Then  the  probability  of  w conditional  on  E (or  given  E ),  denoted  by 
P(w  | E) , is  defined  as 

0 if  w <£  E 

P(w\E)  = < 

P(w)  if  we  E • 

AE) 

For  example,  let  W = {a,b,c,d,e,f,g}  and  let  P be  as  shown  in  Table  8.2. 

state  a b c d e f g 

probability  i 0 ± 0 ^ ^ 

Table  8.2 

Let  E={a,d,e,gj;  thus  P(E)  = P(a)  + P(d)  + P(e)  + P(g)  = ^ + + 0 + ^ = ^ . 

Then  P(b  \ E)  = P(c  \ E ) = P(  f | £")  = 0 (since  each  of  these  states  does  not 

belong  to  E),  P{a\E)  = = \ , P(d\E)  = ^f  = \,  P(e\E)  = ^-  = 0 and 

20  20  20 

_5_ 

P(g  | E)  = -f-  = |.  Thus  the  operation  of  conditioning  on  event  E yields  the 

20 

updated  (on  E)  probability  distribution  shown  in  Table  8.3. 

state  abode  f g 
probability  f 0 0 \ 0 0 f 

Table  8.3 
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Philosophers  and  logicians  have  argued  that,  when  forming  new  beliefs  in 
response  to  new  information,  the  rational  way  to  proceed  is  to  modify  the 
initial  probabilities  by  using  the  conditional  probability  rule.  Of  course,  this 
can  only  be  done  if  the  new  information  is  not  surprising,  in  the  sense  that  it  is 
represented  by  an  event  E that,  given  the  initial  beliefs,  has  positive  probability 
(P(E)  > 0).9 


Definition  8.3.  We  use  the  expression  ‘belief  updating’  or  ‘ Bayesian  updating’  to 
refer  to  the  modification  of  initial  beliefs  (expressed  by  an  initial  probability 
distribution  P)  obtained  by  applying  the  conditional  probability  rule;  this 
assumes  that  the  belief  change  is  prompted  by  the  arrival  of  new  information, 
represented  by  an  event  E such  that  P(E)  > 0. 


Definition  8.4.  The  conditional  probability  formula  can  be  extended  from 
states  to  events  as  follows.  Let  A and  B be  two  events,  with  P(B)  > 0.  Then  the 
probability  of  A conditional  on  B (or  given  B ’),  denoted  by  P(A  | B)  is  defined  by 


P(A\B)  = 


PjAnB ) 
P(B) 


It  is  straightforward  to  check  (the  reader  should  convince  himself/herself  of 
this)  that  an  equivalent  way  of  defining  P(A  \ B ) is  as  follows: 

P(A\B)=  £ P{w\B). 

weAr\B 


For  example,  consider  again  the  initial  beliefs  shown  in  Table  8.2,  which  is 
reproduced  below. 


state  a b c d e f g 

probability  20  0 20  20  0 20  20 

As  before,  let  E = {a,d,e,g}  and  consider  the  event  D = {a,b,c,f,g}  . What  is 
the  probability  of  D given  El  First  compute  the  intersection  of  D and  E : 
D nE  = {a,g} ; thus  P(D  n E)  = P(a)  + P(g)  = 26+2o  = io-  P(E)  was  computed 
before:  P(E)  = P(a)  + P(d)  + P(e)  + P(g)  =f  + ^ + 0 + ^ = -f . Thus,  using  the 
formula  given  in  Definition  8.4,  we  obtain 


9 When  P(E) 
arithmetics. 


0,  P(w\E)  is  not  well  defined  because  division  by  zero  has  no  meaning  in 
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P(D  | E ) = =^.  = | = before  receiving  information  A,  the 

P(E)  20 

individual  was  assigning  probability 

P(D)  = P(a)  + P(b)  + P(c)  + P(/)  + P(g)  =±  + 0 + ± + ± + ± = % = 95%  to 
event  D and,  after  receiving  information  E,  she  updates  that  probability  to  89%. 


According  to  Definition  8.4,  the  probability  of  A conditional  on  B is  given  by 


P(A\B)  = 


P(AnB ) 
P(B) 


(*) 


Similarly,  the  probability  of  B conditional  on  A is  given  by 
P(B  o 

P(B  I A)  = , which  [since  Ac\B  = B nA  and  thus 

P(A ) 

P(A  n B)  = P(B  r^A)]  can  be  written  as 


P(AnB)  = P(B\A)P(A). 


(♦) 


From  (*)  and  (♦)  we  get  that  P{A  \ B) 
Bayes  ’ rule. 


P(B  | A)P(A)  whkh  -s  known 
P(B ) 


Definition  8.5.  Let  E and  F be  two  events,  with  P(F)  > 0 . Then  Bayes ’ rule 
says  that  the  probability  of  E conditional  on  F is  equal  to  the  probability  of  F 
conditional  on  E times  the  probability  of  E,  divided  by  the  probability  of  F: 


P(E  | F)  = 


P(F  | E)P(E) 

P(F) 


As  an  example  of  the  application  of  Bayes’  rule,  suppose  that  you  are  a doctor 
examining  a middle-aged  man  who  complains  of  lower-back  pain.  You  know 
that  25%  of  men  in  your  patient’s  age  group  suffer  from  lower-back  pain.  There 
are  various  causes  of  lower  back  pain;  one  of  them  is  chronic  inflammation  of 
the  kidneys.  This  is  not  a very  common  disease:  it  affects  only  4%  of  men  in  the 
age  group  you  are  considering.  Among  those  who  suffer  from  chronic 
inflammation  of  the  kidneys,  85%  complain  of  lower-back  pain.  What  is  the 
probability  that  your  patient  has  chronic  inflammation  of  the  kidneys? 
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Let  I denote  inflammation  of  the  kidneys  and  L denote  lower-back  pain.  The 
information  you  have  is  that  P(I)  = = 4% , P(L)  = -^  = 25%  and 

P(L  1 1)  = -j^j  = 85% . Thus,  using  Bayes’  rule  we  get  that 

P(I  | L)  = - DJ—1  = 100 100  = 0.136  = 13.6%.  This  can  also  be  seen  as 

P{L)  # 

follows.  Suppose  there  are  2,500  men  in  your  patient’s  age  group;  4%  (=  100) 
have  inflammation  of  the  kidneys  and  the  remaining  96%  (=  2,400)  don’t: 


Lower-back 

pain 

No  lower- 
back  pain 

Total 

Inflammation 

100 

No  inflammation 

2,400 

2,500 

Total 

We  also  know  that  25%  (=  625)  of  the  population  has  lower-back  pain  and  the 
remaining  75%  (=1,875)  don’t: 


Lower-back 

pain 

No  lower- 
back  pain 

Total 

Inflammation 

100 

No  inflammation 

2,400 

625 

1,875 

2,500 

Total 

Of  the  100  people  with  inflammation  of  the  kidneys,  85%  (=  85)  suffer  from 
lower-back  pain  and  the  remaining  15  don’t: 


Lower-back 

pain 

No  lower- 
back  pain 

Total 

Inflammation 

85 

15 

100 

No  inflammation 

2,400 

625 

1,875 

2,500 

Total 

Thus  we  can  fill  in  the  highlighted  cells  by  taking  the  differences: 
625  -85  = 540  and  1,875  - 15  = 1,860: 


Lower-back 

pain 

No  lower- 
back  pain 

Total 

Inflammation 

85 

15 

100 

No  inflammation 

540 

1,860 

2,400 

625 

1,875 

2,500 

Total 

Thus  the  fraction  of  people,  among  those  who  suffer  from  lower-back  pain,  who 
have  inflammation  of  the  kidneys  is  = 0.136  = 13.6%  . 
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So  far  we  have  focused  on  belief  updating.  The  next  section  deals  with  belief 
revision. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  8.E.2  of  Appendix  8.E  at  the 
end  of  this  chapter. 


8.3  Belief  revision 

[Note:  the  material  of  this  section  will  not  be  needed  until  Chapter  f2.  It  is 
presented  here  for  completeness  on  the  topic  of  beliefs.] 

How  should  a “rational”  individual  revise  her  beliefs  when  receiving 
information  that  is  surprising,  that  is,  when  informed  of  an  event  E to  which 
her  initial  beliefs  assigned  zero  probability  (P(E)  = 0)?  The  best  known  theory 
of  “rational”  belief  revision  is  the  so-called  AGM  theory,  which  takes  its  name 
from  its  originators:  Carlos  Alchourron  (a  legal  scholar),  Peter  Gardenfors  (a 
philosopher)  and  David  Makinson  (a  computer  scientist);  their  pioneering 
contribution  was  published  in  1985.  Just  like  the  theory  of  expected  utility 
(Chapter  4),  the  AGM  theory  is  an  axiomatic  theory:  it  provides  a list  of 
“rationality”  axioms  for  belief  revision  and  provides  representation  theorems.10 
Although  the  AGM  theory  was  developed  within  the  language  of  propositional 
logic,  it  can  be  restated  in  terms  of  a set  of  states  and  a collection  of  possible 
items  of  information  represented  as  events.  We  first  introduce  the  non- 
probabilistic  version  of  the  theory  and  then  add  graded  beliefs,  that  is, 
probabilities. 

Let  W be  a finite  set  of  states  and  £c2*  a collection  of  events  (subsets  of  W) 
representing  possible  items  of  information;  we  assume  that  W e E and  0 £ £ . 
To  represent  initial  beliefs  and  revised  beliefs  we  introduce  a function 
/ : £ — > 2W  , which  we  call  a belief  revision  function. 


10  We  will  not  list  and  discuss  the  axioms  here.  The  interested  reader  can  consult 
http:/ / plato.stanford.edu/ entries/ formal-belief/  or,  for  a discussion  which  is  closer  to  the 
approach  followed  in  this  section,  see  Bonanno  (2009). 
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Definition  8.6.  Let  W be  a finite  set  of  states  and  £ a collection  of  events  such 
that  W e £ and  0 & £ . A belief  revision  function  is  a function  / : £ — » 2"  that 
satisfies  the  following  properties:  for  every  E e £ , (1)  / ( E ) <z  E and  (2) 

f(E)±0. 

The  interpretation  of  a belief  revision  function  is  as  follows.  First  of  all,  f(W ) 
represents  the  initial  beliefs,  namely  the  set  of  states  that  the  individual  initially 
considers  possible.  Secondly,  for  every  E e £ , / (A)  is  the  set  of  states  that 
the  individual  would  consider  possible  if  informed  that  the  true  state  belongs  to 
E;  thus  f(E)  represents  the  individual’s  revised  beliefs  after  receiving 

information  E. 

One  of  the  implications  of  the  AGM  axioms  for  belief  revision  is  the  following 
condition,  which  is  known  as  Arrow’s  Axiom  (proposed  by  the  Nobel  laureate 
Ken  Arrow  in  the  context  of  rational  choice,  rather  than  rational  belief 
revision): 


if  E,F  e £,  £cF  and  An/ (F)  * 0 then  / (A)  = A n / (A) . 

Arrow’s  Axiom  says  that  if  information  A implies  information  F(AcF)  and 
there  are  states  in  A that  would  be  considered  possible  upon  receiving 
information  A (An/(A)^0),  then  the  states  that  the  individual  would 
consider  possible  if  informed  that  A are  precisely  those  that  belong  to  both  A 
and  /(A)  (/(A)  = An /(A)).  Although  necessary  for  a belief  revision  policy 
that  satisfies  the  AGM  axioms,  Arrow’s  Axiom  is  not  sufficient.  Before  stating 
the  necessary  and  sufficient  condition  for  rational  belief  revision,  we  remind 
the  reader  of  the  notion  of  a complete  and  transitive  relation  on  a set  W 


11  If  the  initial  beliefs  were  to  be  expressed  probabilistically,  by  means  of  a probability 
distribution  P over  W,  then /[IT)  would  be  the  support  of  P,  that  is,  the  set  of  states  to  which 
P assigns  positive  probability.  Thus/[IG)  would  be  the  smallest  event  of  which  the  individual 
would  initially  be  certain:  she  would  initially  be  certain  of  any  event  Esuch  that  f[W)  c F. 

12  If  the  revised  beliefs  after  receiving  information  E were  to  be  expressed  probabilistically,  by 
means  of  a probability  distribution  Pn  over  W,  then//)  would  be  the  support  of  PE,  that  is, 

the  set  of  states  to  which  Pe  assigns  positive  probability.  Thus  f[E)  would  be  the  smallest 
event  of  which  the  individual  would  be  certain  after  having  been  informed  that  E:  according 
to  her  revised  beliefs  she  would  be  certain  of  any  event  Esuch  that  f[E)  c E.  [Note  that,  since 
- by  assumption  - /[£)  <z  E,  the  individual  is  assumed  to  be  certain  of  the  information  received 
(e.g.  because  she  trusts  the  source  of  the  information).] 
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(Remark  1.1,  Chapter  1).  In  Chapter  1 the  relation  was  denoted  by  <Z  and  was 
interpreted  in  terms  of  preference:  o o'  was  interpreted  as  “the  individual 
considers  outcome  o to  be  at  least  as  good  as  outcome  o'”.  In  the  present 
context  the  relation  will  be  denoted  by  ^ and  the  interpretation  is  in  terms  of 
“plausibility”:  w^,  W is  to  be  read  as  “the  individual  considers  state  w to  be  at 
least  as  plausible  as  state  w”  (w<W  means  that  w is  considered  to  be  more 
plausible  than  w and  w ~ w means  that  w is  considered  to  be  just  as  plausible 
as  w').13  An  alternative  reading  of  wfw'  is  “w  weakly  precedes  w'  in  terms  of 
plausibility”. 

Definition  8.7.  A plausibility  order  on  a set  of  states  W is  a binary  relation  ^ 
on  W that  is  complete  (for  every  two  states  wl  and  w2 , either  vy  ^ w2  or 
w2  C w,  or  both)  and  transitive  (if  w,  C w2  and  w2  C vy  then  w,  C vy  ) . We 
define  w -<  w as  “ vv  C w and  w'  f w ” and  w ~ w as  “ vv  C W and  W C vv”). 

Theorem  8.1  (Grove,  1988;  Bonanno,  2009).  Let  Wbe  a finite  set  of  states,  £ a 
collection  of  events  (representing  possible  items  of  information),  with  W e £ 
and  0^,  and  /:£—>■  2H  a belief  revision  function  (Definition  8.6).  Then  the 
belief  revision  policy  represented  by  the  function  / is  compatible  with  the 
AGM  axioms  of  belief  revision  if  and  only  if  there  exists  a plausibility  order  ^ 
on  W that  rationalizes  f in  the  sense  that,  for  every  E e £ , f{E ) is  the  set  of 

most  plausible  states  in  E:  f ( E ) = [w  e E :w^w  for  evety  w e E}  . 

Definition  8.8.  A belief  revision  function  / \£^2]]  which  is  rationalized  by 
a plausibility  order  is  called  an  AGM  belief  revision  function. 

Remark  8.2.  An  AGM  belief  revision  function  satisfies  Arrow’s  Axiom  (the 
reader  is  asked  to  prove  this  in  Exercise  8.6).  The  converse  is  not  true:  it  is 
possible  for  a belief  revision  function  / : £ — » 2"  to  satisfy  Arrow’s  Axiom  and 
yet  fail  to  be  rationalized  by  a plausibility  order. 


13  One  reason  for  the  reversal  of  notation  is  that,  in  this  context,  when  representing  the  order 
^ numerically,  it  will  be  convenient  to  assign  lower  values  to  more  plausible  histories. 
Another  reason  is  that  it  is  the  standard  notation  in  the  extensive  literature  that  deals  with 
AGM  belief  revision  (for  a recent  survey  of  this  literature  the  reader  is  referred  to  the  special 
issue  of  the  Journal  of  Philosophical  Logic,  Vol.  40,  Issue  2,  April  2011). 
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Within  the  context  of  probabilistic  beliefs,  let  P be  the  probability  distribution 
on  the  set  of  states  W that  represents  the  initial  beliefs  and  Pe  be  the  probability 
distribution  representing  the  updated  beliefs  after  receiving  not  surprising 
information  E (not  surprising  in  the  sense  that  P(E)  > 0).  The  support  of  a 
probability  distribution  P,  denoted  by  Supp(P),  is  the  set  of  states  to  which  P 
assigns  positive  probability:  Supp(P)  = {we  W :P(w)  > 0} . The  rule  for 

updating  beliefs  upon  receiving  information  E (Definition  8.2)  implies  the 
following: 

if  E n Supp(P)  Y 0 (that  is,  if  P(E)  > 0)  then  Supp(PE)  = E nSupp(P) . 

We  call  this  the  qualitative  belief  updating  rule.  It  is  easy  to  check  that  the 
qualitative  belief  updating  rule  is  implied  by  Arrow’s  Axiom.14  Thus,  by 
Remark  8.2,  an  AGM  belief  revision  function  has  incorporated  in  it  the 
qualitative  belief  updating  rule.  In  other  words,  belief  updating  is  included  in  the 
notion  of  AGM  belief  revision.  A belief  revision  function,  however,  goes  beyond 
belief  updating  because  it  also  encodes  new  beliefs  after  receipt  of  surprising 
information  (that  is,  after  being  informed  of  an  event  E such  that  P(E)  = 0). 

What  is  the  probabilistic  version  of  AGM  belief  revision?  It  turns  out  that  in 
order  to  obtain  probabilistic  beliefs  we  only  need  to  make  a simple  addition  to 
an  AGM  belief  revision  function  / : E — > 2 11  . Let  P0  be  any  full-support 
probability  distribution  on  W (that  is,  P0(w)  > 0 , for  every  weW)  . Then,  for 
every  E e £ let  PE  be  the  probability  distribution  obtained  by  conditioning  P0 

on  f(E): 


PE(w)  = P0(w\f(E))  = \ 


X W) 

w'e/(£) 


lo 


if  wef(E) 
if  w£  f(E) 


Then  Pw  gives  the  initial  probabilistic  beliefs  and,  for  every  other  E ef , PE 
gives  the  revised  probabilistic  beliefs  after  receiving  information  E.  The 


14  For  every  event  E representing  a possible  item  of  information,  let  PE  be  the  probability 
distribution  on  E representing  the  revised  beliefs  of  the  individual  after  receiving  information 
E.  Let  P be  the  probability  distribution  on  W representing  the  individual’s  initial  beliefs. 
Define  the  following  belief  revision  function/  : f(W)  = Supp(P)  and  /(£)  = Supp(PE) . 
Suppose  that  / satisfies  Arrow’s  Axiom.  Then,  for  every  event  E,  if  Ec\f  (W)  ^ 0 [that  is,  if 
EnSupp(P)*0]  then  f(E)  = Enf(W)  [that  is,  Supp(PE)  = E rSupp(P)]. 
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collection  {PE}Ee£  of  probability  distributions  on  W so  obtained  gives  the 
individual’s  probabilistic  belief  revision  policy  (while  the  function 
/ : £ — » 2”  gives  the  qualitative  probabilistic  belief  revision  policy). 

Definition  8.9.  Let  Wbe  a finite  set  of  states  and  £ a collection  of  events  such 
that  W e £ and  0 £ £ . A probabilistic  belief  revision  policy  is  a collection 
{PE}EeC  of  probability  distributions  on  W such  that,  for  every  A e £ , 
Supp(PE ) c£.  Pw  represents  the  initial  beliefs  and,  for  every  other  E e £ , PE 
represents  the  revised  beliefs  after  receiving  information  E. 

The  collection  { PE } /;  is  called  an  AGM  probabilistic  belief  revision  policy  if  it 
satisfies  the  following  properties: 

(1)  there  exists  a plausibility  order  ^ on  W such  that,  for  every  E e £ , 
Supp(PE ) is  the  set  of  most  plausible  states  in  E,  that  is, 

Supp(PE ) = [we  E :w^w'  for  every  w e E}  ,15  and 

(2)  there  exists  a full-support  probability  distribution  P0  on  W such  that, 
for  every  E e £ , PE  is  equal  to  the  probability  distribution  obtained  by 
conditioning  P0  on  Supp(PE) . 

As  we  will  see  in  Part  IV,  belief  revision  is  very  important  in  dynamic  (or 
extensive-form)  games.  In  such  games  a player  may  find  herself  at  an 
information  set  that,  according  to  her  initial  beliefs,  had  zero  probability  of 
being  reached  and  thus  will  have  to  form  new  beliefs  reflecting  the  unexpected 
information. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  8.E.3  of  Appendix  8.E  at  the 
end  of  this  chapter. 


15  This  condition  says  that  if  one  defines  the  function  > 2n  by  f (E)  = Supp(PE)  then 

this  function/  is  an  AGM  belief  revision  function. 
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8 . 4 Harsanyi  consistency  of  beliefs  or  like-mindedness 

[Note:  the  material  of  this  section  will  not  be  needed  until  Chapter  13.  It  is 
presented  here  for  completeness  on  the  topic  of  beliefs.] 

We  can  easily  extend  the  analysis  to  the  case  of  two  or  more  individuals.  We 
already  know  how  to  model  interactive  knowledge  by  means  of  information 
partitions;  the  addition  of  beliefs  is  a simple  step:  we  merely  add,  for  every 
individual  and  for  every  information  set,  a probability  distribution  over  the 
elements  of  that  information  set.  A two-person  example  is  shown  in  Figure  8.4 
below. 


Figure  8.4 

In  this  example,  at  every  state,  the  two  individuals  hold  different  beliefs.  For 
example,  consider  event  E ={b,c } and  state  a.  Individual  1 attaches  probability 
\ to  E while  Individual  2 attaches  probability  } to  E.  Can  two  “equally 
rational”  individuals  hold  different  beliefs?  The  answer  is:  of  course!  In  the 
above  example  it  is  not  surprising  that  the  two  individuals  assign  different 
probabilities  to  the  same  event  E,  because  they  have  different  information.  If 
the  true  state  is  a,  then  Individual  l’s  information  is  that  the  true  state  is  either 
a or  b or  c,  while  Individual  2 considers  only  a and  b possible  (Individual  2 
knows  more  than  Individual  1).  Is  there  a precise  way  of  expressing  the  fact  that 
two  individuals  assign  different  probabilities  to  an  event  precisely  because  they 
have  different  information ? In  the  above  example  we  could  ask  the  hypothetical 
question:  if  Individual  1 had  the  same  information  as  Individual  2,  would  he 
agree  with  Individual  2’s  assessment  that  the  probability  of  event  E = { b,c } is 
j ? This  is,  of  course,  a counterfactual  question.  The  answer  to  this 
counterfactual  question  is  affirmative:  imagine  giving  Individual  1 the 
information  that  the  true  state  is  either  a or  b;  then  - according  to  Definition 
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8.3  - he  would  update  his  beliefs  from 

f a b 
1 1 1 ' 

to 

( a b ' 

V 2 4 4 J 

13  3j 

and  thus  have  the 


same  beliefs  as  Individual  2. 


We  say  that  two  individuals  are  like-minded  if  it  is  the  case  that  they  would 
have  the  same  beliefs  if  they  had  the  same  information.  It  is  not  straightforward 
how  to  turn  this  into  a precise  definition.  Consider,  again,  the  example  of 
Figure  8.4  and  event  E = {b,c}.  Above  we  asked  the  question  “what  would 
Individual  1 believe  if  he  knew  as  much  as  Individual  2?”.  This  is  a simple 
question  because  we  can  imagine  giving  more  information  to  Individual  1 and 
have  him  update  his  beliefs  based  on  that  information.  However,  we  could  also 
have  asked  the  question  “what  would  Individual  2 believe  if  he  knew  as  little  as 
Individual  1?”.  In  this  case  we  would  have  to  imagine  “taking  away 
information”  from  Individual  2,  by  increasing  his  information  set  from  {a,b}  to 
{a,b,c}.  This  is  not  the  same  as  asking  Individual  2 to  update  his  beliefs  based  on 
information  {a,b,c},  because  updating  on  something  you  already  know  leaves 
your  beliefs  unchanged. 

There  is  a sense  in  which  the  beliefs  of  the  two  individuals  of  Figure  8.4  are  “in 
agreement”:  for  example,  they  both  consider  state  a twice  as  likely  as  state  b. 
One  could  try  to  use  this  condition  to  define  like-mindedness:  for  every  two 
states  x and  y,  whenever  two  individuals  consider  both  x and  y as  possible 
(given  their  information)  then  they  agree  on  the  relative  likelihood  of  x versus 
y.  Unfortunately,  this  condition  is  too  weak.  To  see  this,  consider  the  three- 
individual  example  of  Figure  8.5. 


1: 

2: 

3: 


0 

b c 

a b 

3 

a 

b 

c 

Figure  8.5 


No  matter  what  the  true  state  is,  we  cannot  find  two  individuals  and  two  states 
x and  y that  both  individuals  consider  possible.  Thus  any  beliefs  would  make 
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the  three  individuals  like-minded.  For  example,  Individual  1 at  his  information 
set  { b,c } could  assign  equal  probability  to  b and  c,  Individual  2 at  her 
information  set  {a,b}  could  assign  probability  | to  a and  \ to  b and  Individual 
3 at  his  information  set  {a,c}  could  assign  probability  \ to  a and  to  c.  But 
then,  putting  together  the  beliefs  of  Individuals  1 and  2 we  would  have  that  a is 
judged  to  be  three  times  more  likely  than  c (according  to  Individual  1,  a is  three 
times  more  likely  than  b and,  according  to  Individual  2,  b is  just  as  likely  as  c), 
while  Individual  3 has  the  opposite  judgment  that  c is  three  times  more  likely 
than  a. 

In  order  to  give  a precise  definition  of  like-mindedness  we  need  to  introduce 
some  notation.  Let  there  be  n individuals  (n  > 2).  Let  Wbe  a set  of  states  and  let 
I.  be  the  information  partition  of  individual  i ( i e{l ,...,??}).  As  usual,  if  w is  a 

state,  we  denote  by  /,.(tv)  the  element  (information  set)  of  the  partition  I.  that 
contains  w.  Let  P.  „ be  the  beliefs  of  individual  i at  state  w,  that  is,  P.  is  a 
probability  distribution  over  /.(tv) . Clearly,  we  can  think  of  Piw  as  a 
probability  distribution  over  the  entire  set  of  states  W satisfying  the  property 
that  if  w'g/(.(w)  then  Piw(w')  = 0.  Note  that,  for  every  individual  i,  there  is 

just  one  probability  distribution  over  /.(tv)  and  thus  if  tv'e/;(tv)  then 
P ,=P  . 

I, XV  l, XV 

Definition  8.10.  A probability  distribution  P over  W is  called  a common  prior 
if,  for  every  individual  i and  for  every  state  w,  (1)  P(/,(tv))>0  and  (2) 
updating  P on  /(w)  (see  Definition  8.4)  yields  precisely  Piw,  that  is, 
P(w'\  /.(tv))  = Piw(xv')  , for  every  W eW  . 

When  a common  prior  exists  we  say  that  the  individuals  are  like-minded  or  that 
the  individuals’  beliefs  are  Harsanyi  consistent .16 


16  John  Harsanyi,  who  in  1994  won  the  Nobel  prize  in  Economics  (together  with  Reinhardt 
Selten  and  John  Nash),  introduced  the  theory  of  games  of  incomplete  information  which  will 
be  the  object  of  Part  V.  In  that  theory  the  notion  of  Harsanyi  consistency  plays  a crucial 
role. 
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For  instance,  in  the  example  of  Figure  8.4,  reproduced  below,  a common  prior 
exists  and  thus  the  two  individuals  are  like-minded. 


(•’ 

• 4 

•i] 

/ 

f 

• i 

a 

b 

c 

d 

e 

S' 

•f 

•1 

s 

•1 

• ! 

• i 

A common  prior  is 


'abed 

ill 
V 8 8 8 


e 

2 2 


(the  reader  should  convince 


himself/herself  that,  indeed,  updating  this  probability  distribution  on  each 
information  set  yields  the  probability  distribution  written  inside  that 
information  set). 

How  can  we  determine  if  a common  prior  exists?  The  issue  of  existence  of  a 
common  prior  can  be  reduced  to  the  issue  of  whether  a system  of  equations  has 
a solution.  To  see  this,  let  us  go  back  to  the  example  of  Figure  8.4,  reproduced 
above.  A common  prior  would  be  a probability  distribution 


'a  b c 

V Pa  Pb  Pc 


d e 

Pd  PeJ 


that  satisfies  the  following  conditions. 


f . Updating  on  information  set  {a,b,c}  of  Individual  f we  need 

Pb  _ 1 Pc 


= \ and 


j. 

4 


Pa+Pb+Pc  Pa+Pb+Pc 

Note  that  from  these  two  the  third  condition  follows,  namely 

Pa 


Pa+Pb+Pc 


_ 1 
2 * 


2.  Updating  on  information  set  {d,e}  of  Individual  1 we  need 


Pc 


Pd+Pc 


2 ’ 


from  which  it  follows  that 


Pd+Pc 


1 

2 • 
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3.  Updating  on  information  set  {a,b}  of  Individual  2 we  need 


Pa 
Pa+Pb 


from  which  it  follows  that 


Pb 

Pa+Pb 


I 

3 ' 


2 

3 ’ 


4.  Updating  on  information  set  {c,d}  of  Individual  2 we  need 


Pc  + Pd 


i 
3 ’ 


from  which  it  follows  that 


Pci 


Pc+Pd 


2 

3 ' 


From  the  first  we  get 

Pb  = Pc 

, from  the  second 

Pd  = Pc 

, from  the  third 

Pa  = 2Pb 

and  from  the  fourth 

Pd  = 2Pc 

. Adding  to  these  three  equalities  the 

requirement  that  pa  + pb  + pc+  pd+  pe=  1 , we  have  a system  of  five  equations 


in  five  unknowns,  which  admits  a unique  solution,  namely 


a b c d e 


It  is  not  always  the  case  that  a common  prior  exists.  For  instance,  if  we  add  to 
the  example  of  Figure  8.5  the  beliefs  shown  in  Figure  8.6,  then  we  get  a 
situation  where  the  individuals’  beliefs  are  not  Harsanyi  consistent. 

1:  Q 


2: 

3: 


a\  b\]  \ c | 

a ! 

b 

c\ 

Figure  8.6 


In  this  case,  from  the  updating  conditions  of  Individuals  1 and  2 one  would  get 
that  pa  = pb  and  ph  = pc,  from  which  it  follows  that  pa  = pc ; however,  from 
the  updating  condition  for  Individual  3 we  get  that  pa  = 3pc , yielding  a 
contradiction. 
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This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  8.E.4  of  Appendix  8.E  at  the 
end  of  this  chapter. 


8.5  Agreeing  to  disagree 

Can  two  rational  and  like-minded  individuals  agree  to  disagree?  This  question 
was  raised  in  1976  by  Robert  Aumann  (who  received  the  Nobel  Prize  in 
Economics  in  2005,  together  with  Thomas  Schelling). 

As  remarked  above,  it  is  certainly  quite  possible  for  two  rational  individuals  to 
have  different  beliefs  about  a particular  event,  because  they  might  have 
different  information.  Let  us  go  back  to  the  example  of  Figure  8.4,  reproduced 
below  in  Figure  8.7,  together  with  the  common  prior  (showing  that  the 
individuals  are  like-minded).  Suppose  that  the  true  state  is  a and  consider  what 
the  two  individuals  believe  about  event  E = { b,c }.  Individual  l’s  information 
set  is  {a,b,c}  and,  given  his  beliefs  at  that  information  set,  he  attaches 
probability  ^ + ^ = t to  E\  let  us  denote  this  by  “at  state  a,  Pl{E)  = \”. 
Individual  2’s  information  set  is  {a,b}  and,  given  her  beliefs  at  that  information 
set,  she  attaches  probability  } to  E : let  us  denote  this  by  “at  state  a,  P2(E ) = }”. 
Thus  the  two  individuals  disagree  about  the  probability  of  event  E. 
Furthermore,  they  know  that  they  disagree.  To  see  this,  let  ||/j(is)  = y||  be  the 

event  that  Individual  1 attaches  probability  \ to  E;  then  |/J(£')  = y|  = {a,  b,  c}. 
Similarly,  let  ||/52(A)  = y||  be  the  event  that  Individual  2 attaches  probability  \ 
to  E\  then  ||P2(E')  = y||  = {a,  b,  c,  d}.  These  are  events  and  thus  we  can  check  at 
what  states  the  two  individuals  know  them.  Using  Definition  7.3  (Chapter  7), 
we  have  that  Kl  |P2(C)  = }||  = {a,b,c}  and  K2  |/j(is)  = y||  = {a,b} . Hence 


«e||/>(£)  = i||  n ||P2(A)  = }||  n Kx  \\P2(E)  = }||  n ^ ||i>(£)  = i| 
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E = {b  , c} 


1: 

2: 


"common 

prior" 


Pi(E)  = i 

Pi(E) 

= 0 

w 2 W 4 
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-J 

a b 
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d 

e 

• 2 • 1 

w 3 w 3 

V-  -J 

• 1 

•! 

/ \ 

• i 

L A 

P2(E)  = 1 

P2(E)  = 

_ i 

3 

P2(E)  = 

Figure  8.7 

Thus  at  state  a not  only  do  the  individuals  disagree,  but  they  know  that  they 
disagree.  However,  their  disagreement  is  not  common  knowledge.  Indeed, 
KXK2  ||/J(is)  = |||  = 0 and  thus  a £ KXK2  |/}(is)  = |||  = 0 , that  is,  at  state  a it  is 
not  the  case  that  Individual  1 knows  that  Individual  2 knows  that  Individual  1 
assigns  probability  j to  event  E.  As  the  following  theorem  states,  the  opinions 
of  two  like-minded  individuals  about  an  event  E cannot  be  in  disagreement  and, 
at  the  same  time,  commonly  known. 

The  following  theorem  is  proved  in  Appendix  8. A at  the  end  of  this  chapter. 

Theorem  8.1  (Agreement  Theorem;  Aumann,  1976).  Let  W be  a set  of  states 
and  consider  a knowledge-belief  structure  with  two  individuals,  1 and  2.  Let  E 
be  an  event  and  let  p,q  e [0,1].  Suppose  that  at  some  state  w it  is  common 
knowledge  that  Individual  1 assigns  probability  p to  E and  Individual  2 assigns 
probability  q to  E.  Then,  if  the  individuals’  beliefs  are  Harsanyi  consistent 
(Definition  8.10),  p = q.  In  other  words,  two  like-minded  individuals  cannot 
agree  to  disagree  about  the  probability  of  an  event.  Formally,  if  there  exists  a 
common  prior  and  we  CK^P^E)  = /t||n||P2(£')  = g||)  then/>  = q. 
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Another  way  to  think  of  this  result  is  to  imagine  that  the  two  individuals 
communicate  their  opinions  to  each  other.  Hearing  that  the  other  person  has  a 
different  opinion  is  in  itself  a valuable  piece  of  information,  which  ought  to  be 
incorporated  (by  updating)  into  one’s  beliefs.  Thus  sequential  communication 
leads  to  changing  beliefs.  If,  at  the  end  of  this  process,  the  beliefs  become 
common  knowledge,  then  they  must  be  identical.  We  shall  illustrate  this  with 
an  example. 

Imagine  two  scientists  who  agree  that  the  laws  of  Nature  are  such  that  the  true 
state  of  the  world  must  be  one  of  seven,  call  them  a,  b,  c,  d,  e,  f,  g.  They  also 
agree  on  the  relative  likelihood  of  these  possibilities,  which  they  take  to  be  as 
shown  in  Figure  8.8. 


4/32 


2/32 


8/32 


5/32 


7/32 


2/32 


4/32 


Figure  8.8 


Experiments  can  be  conducted  to  learn  more.  An  experiment  leads  to  a 
partition  of  the  set  of  states.  For  example,  if  the  true  state  is  a and  you  perform 
an  experiment  then  you  might  learn  that  the  true  state  cannot  be  d or  e or/or  g 
but  you  still  would  not  know  which  is  the  true  state  among  the  remaining 
ones.  Suppose  that  the  scientists  agree  that  Scientist  1,  from  now  on  denoted  by 
Si,  will  perform  experiment  1 and  Scientist  2,  denoted  by  Si,  will  perform 
experiment  2.  They  also  agree  that  each  experiment  would  lead  to  a partition  of 
the  set  of  states  as  shown  in  Figure  8.9. 
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Experiment  1 : 


Experiment  2: 


Figure  8.9 


Suppose  that  the  scientists  are  interested  in  establishing  the  truth  of  a 
proposition  that  is  represented  by  the  event  E = {a,  c,  d,  ej.  Initially  (given  the 
shared  probabilistic  beliefs  shown  in  Figure  8.8)  they  agree  that  the  probability 
that  E is  true  is  75%: 

P{E)  = P(a)  + P{c)  + P(d)  + P(e)=£  + £ + £ + 1 L = |i  = 2 = 75%. 

Before  they  perform  the  experiments  they  also  realize  that,  depending  on  what 
the  true  state  is,  after  the  experiment  they  will  have  an  updated  probability  of 
event  E conditional  on  what  the  experiment  has  revealed.  For  example,  they 
agree  that  if  one  performs  experiment  1 and  the  true  state  is  b then  the 
experiment  will  yield  the  information  F = {a,b,c}  and  P(E  \ F ) is  given  by 
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P{E\F)  = ^n=  f(a)  + f(c)  = — iil— = JJ  = i.  86%. 

P(F)  P(a)  + P(b)  + P(c)  i + i + i 

[Note  the  interesting  fact  that  sometimes  experiments,  although  they  are 
informative  - that  is,  they  reduce  one’s  state  of  uncertainty  - might  actually 
induce  one  to  become  more  confident  of  the  truth  of  something  that  is  false:  in 
this  case  one  would  increase  one’s  subjective  probability  that  E is  true  from 
75%  to  86%,  although  E is  actually  false  if  the  true  state  is  b,  as  we 
hypothesized).] 

We  can  associate  with  every  cell  of  each  experiment  (that  is,  with  every  possible 
state  of  information  yielded  by  the  experiment)  a new  updated  probability  of 
event  E,  as  shown  in  Figure  8.10. 

Experiment  1 : 

Pi'ob(E)  = 12/14  Prob(E)  = 12/14  Prob(E)  = 0 


Experiment  2: 


Prob(E)  = 15/21 


Prob(E)  = 9/1 1 


Figure  8.10 

Suppose  now  that  each  scientist  goes  to  her  laboratory  and  performs  the 
respective  experiment  (Scientist  1 experiment  1 and  Scientist  2 experiment  2). 
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. Suppose  that  the  two  scientists  send 

each  other  an  e-mail  communicating  their  new  subjective  estimates  of  the  truth 
of  E.  Scientist  1 writes  that  he  now  attaches  probability  jf  to  E,  while  Scientist 
2 says  that  she  attaches  probability  to  E.  So  their  estimates  disagree  (not 
surprisingly,  since  they  have  performed  different  experiments  and  have 
collected  different  information).  Should  they  be  happy  with  these  estimates? 
The  answer  is  negative. 

Consider  first  Si  (Scientist  1).  He  hears  that  S2  has  a revised  probability  of  ^ 
(recall  our  assumption  that  the  true  state  is/).  What  does  he  learn  from  this? 
He  learns  that  the  true  state  cannot  be  d (if  it  had  been,  then  he  would  have 
received  an  e-mail  from  S2  saying  “my  new  probability  is  / ”).  Si’s  new  state  of 
knowledge  and  corresponding  probabilities  after  receiving  S2’s  e-mail  are  then 
as  shown  in  Figure  8.11. 

If  g is  the  true  state, 
then  experiment  1 
reveals  it. 

If  either  d,  e or  f is  the 
true  state  then 
experiment  1 reveals 
{1 d,e,f }.  With  e or  f Si’s 
e-mail  is  “15/21”,  with  d 
it  is  “9/11”.  Thus  with  e 
or/one  can  narrow 
down  to  {e,f}  while 
with  d one  learns  that 
the  true  state  is  d. 

If  the  true  state  is  a,  b or 
c then  experiment  1 
reveals  {a,b,c}.  With  a 
orb  S2  sends  “9/11” 
while  with  c she  sends 
“15/21”.  Thus  the  e- 
mail  splits  {a,b,c}  into 
{a,b}  and  {c}. 


Figure  8.11 


Assume  also  that  the  true  state  is / 
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Consider  now  Scientist  2.  From  Si’s  e-mail  she  learns  that  Si  has  a new  updated 
probability  of  jf . What  can  she  deduce  from  this?  That  the  true  state  is  not  g 
(if  it  had  been  g then  she  would  have  received  an  e-mail  from  Si  saying  that  her 
revised  probability  of  E was  zero).  Thus  she  can  revise  her  knowledge  partition 
by  eliminating  g from  her  information  set.  A similar  reasoning  applies  to  the 
other  states.  Si’s  new  state  of  knowledge  and  corresponding  probabilities  after 
receiving  Si’s  e-mail  are  shown  in  Figure  8T2. 


Experiment  1 : 


Prob(E)  = 12/14  Prob(E)  = 12/14  Prob(E)  = 0 

• • • 

d e f 


• • • 

a b c 


What  scientist  2 learns  from  STs  first  e-mail: 

Prob(E)  = 0 
< \ 

g 

s y 

• • • 

a b d_^ 

Prob(E)  = 9/1 1 


Prob(E) = 15/17 

f \ 

• • • 

c e f 

V X 


If  either  c,  e,f  or  g is 
the  true  state  then 
experiment  2 reveals 
{c,e,fg}.  With  c,  e or/ 
Si’s  e-mail  is  “12/ 14”, 
with  g it  is  “0”.  Thus 
with  c,  e or  / one  can 
narrow  down  to 
{c,e,f}  while  with  g 
one  learns  that  the 
true  state  is  g. 

If  the  true  state  is  a,  b 
or  d then  experiment 
2 reveals  {a,b,d}.  In 
all  three  cases  Si’s  e- 
mail  is  “12/ 14”  and 
therefore  nothing  is 
learned. 


Figure  8.12 


Thus  the  new  situation  after  the  first  exchange  of  e-mails  is  as  shown  in  Figure 
8.13  below. 
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Scientist  1 : 


Prob(E)  = 4/6  !T®  PJ»b'E) 
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9V 

Prob(E)  = 7/9  Prob(E)  = 0 


■ 

f \ 

• 

g 

mm 

A 


Scientist  2: 

Prob(E)  = 15/17 


true  state 

T 

Prob(E)  = 0 


Prob(E)  =9/11 


Figure  8.13 

Now  there  is  a second  round  of  e-mails.  Si  communicates  “in  light  of  your  e- 
mail,  my  new  P(E)  is  7/9”,  while  S2  writes  “after  your  e-mail  I changed  my  P(E) 
to  15/17”.  While  Si  learns  nothing  new  from  Si’s  second  e-mail,  S2  learns  that 
the  true  state  cannot  be  c (the  second  e-mail  would  have  been  “P(E)  = 1”  if  c 
had  been  the  true  state;  in  the  hypothetical  case  where  Si’s  revised  information 
was  {a,b,d}  then  after  Si’s  second  e-mail  it  would  split  into  {a,b}  and  {d}).  Thus 
the  new  situation  is  a shown  in  Figure  8.14. 
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Scientist  1: 


Prob(E)  = 4/6  ^ P™b<E> 


• 

• 

a 

b 

V 

y 

Prob(E)  = 7/9 


Prob(E)  = 0 


true  state 

Scientist  2: 


Prob(E) 

= 4/6 

f 

N 

• 

• 

a 

v 

b 

y 

Prob(E) 
= 1 


Prob(E) 
= 1 


Prob(E)  = 7/9 


Prob(E)  = 0 


Figure  8.14 

Now  they  have  reached  complete  agreement:  P(E)  = Further  exchanges  do 
not  convey  any  new  information.  Indeed  it  has  become  common  knowledge  (at 
state  /)  that  both  scientists  estimate  the  probability  of  event  E to  be  \ = 78% 
(before  the  experiments  the  probability  of  E was  judged  to  be  H = 75%;  note, 
again,  that  with  the  experiments  and  the  exchange  of  information  they  have 
gone  farther  from  the  truth  than  at  the  beginning!). 

Notice  that  before  the  last  step  it  was  never  common  knowledge  between  the 
two  what  probability  each  scientist  attached  to  E.  When  one  scientist 
announced  his  subjective  estimate,  the  other  scientist  found  that  announcement 
informative  and  revised  her  own  estimate  accordingly.  After  the  exchange  of 
two  e-mails,  the  further  announcement  by  one  scientist  of  his/her  estimate  of 
the  probability  of  E would  not  make  the  other  scientist  change  his/her  own 
estimate:  the  announcement  would  reveal  nothing  new. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  8.E.5  of  Appendix  8.E  at  the 
end  of  this  chapter. 
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Appendix  8. A: 

Proof  of  the  Agreement  Theorem 

First  we  prove  the  following. 


Lemma.  Let  W be  a finite  set  of  states,  P a probability  distribution  on  W and 
E,F  cr  W two  events.  Let  {Fv  ...,  FJ  be  a partition  of  F (thus  F = Fx  u ...  u F m 
and  any  two  Fj  and  Fk  with  j ^ k are  non-empty  and  disjoint).  Suppose  that 
P(E\Fj)  = q for  all  j = 1,  ...,  m.  Then  P(E\F)  = q. 


P(E  n F ) 

Proof.  By  definition  of  conditional  probability,  P(E\Ff)  = — p^p  ' Hence, 

since  P(E\Fj)  = q,  we  have  that  P(E  nFJ  = qP(Fj) . Adding  over  j,  the  left- 

hand  side  becomes  P(EnF)  [because  E n F = (E  n F{)  u . . . u (E  n FJ  and 
for  any  j and  k with  j T k,  (E  n F)  n (E  n F f = 0,  so  that  P(E  n F)  = P(E  n 

F{)  + ...  + P(E  n F )]  and  the  right-hand  side  becomes  qP(F).  Hence 

P(E\F)  = P^F)=k^  = q. 


P(F) 


P(F) 


Proof  of  Theorem  8.1.  Suppose  that  CK(JPl(E)  = /j||n||P2(F)  = ^||)  ^ 0 . Let  P 

be  a common  prior.  Select  an  arbitrary  w e CK  (\\PfE)  = p\  n||i^(F)  = g||)  and 
let  I (w)  be  the  cell  of  the  common  knowledge  partition  containing  w. 
Consider  Individual  1.  /CK(w)  is  equal  to  the  union  of  a collection  of  cells 
(information  sets)  of  l’s  information  partition.  On  each  such  cell  l’s 
conditional  probability  of  E,  using  the  common  prior  P,  is  p.  Hence,  by  the 
above  lemma,  P{E\Ick(w))  = p.  A similar  reasoning  for  Individual  2 leads  to 
P(E\Ick(w))  = q.  Hence  p = q.  ■ 
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Appendix  8.E:  Exercises 

8.E.I.  Exercises  for  Section  8.1:  Probabilistic  beliefs 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  8.1.  Let  the  set  of  states  be  W = {a,b,c,d,e,f,g,h,i}.  Amy’s  initial 
beliefs  are  given  by  the  following  probability  distribution: 

state  abcdefg  hi 
probability 

(a)  Let  E = {a,  f ,g,h,i}  . What  is  the  probability  of  A? 

(b)  Find  all  the  events  that  Amy  is  certain  of. 

Exercise  8.2.  Let  W be  a finite  set  of  states  and  let  A and  B be  two  events. 
Recall  that  Au  B is  the  union  of  A and  B,  that  is,  the  set  of  states  that  belong 
to  either  A or  B (or  both)  and  Ac\B  is  the  intersection  of  A and  B,  that  is,  the 
set  of  states  that  belong  to  both  A and  B.  Explain  why 
P(A  u B)  = P(A)  + P(B)  - P(A  n B) . 

8.E.2.  Exercises  for  Section  8.2: 

Conditional  probability,  belief  updating,  Bayes’  rule 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  8.3.  In  a remote  rural  clinic  with  limited  resources,  a patient  arrives 
complaining  of  low-abdomen  pain.  Based  on  all  the  information  available,  the 
doctor  thinks  that  there  are  only  four  possible  causes:  a bacterial  infection  (b),  a 
viral  infection  (v),  cancer  (c),  internal  bleeding  (i).  Of  the  four,  only  the 
bacterial  infection  and  internal  bleeding  are  treatable  at  the  clinic.  In  the  past 
the  doctor  has  seen  600  similar  cases  and  they  eventually  turned  out  to  be  as 
follows: 
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Bacterial  infection 

b 

Viral  infection 

V 

Cancer 

c 

Internal 
bleeding  i 

140 

110 

90 

260 

Her  probabilistic  estimates  are  based  on  those  past  cases. 

(a)  What  is  the  probability  that  the  patient  has  a treatable  disease? 

There  are  two  possible  ways  of  gathering  more  information:  a blood  test  and 
an  ultrasound.  A positive  blood  test  will  reveal  that  there  is  an  infection, 
however  it  could  be  either  a bacterial  or  a viral  infection;  a negative  blood  test 
rules  out  an  infection  (either  bacterial  or  viral)  and  thus  leaves  cancer  and 
internal  bleeding  as  the  only  possibilities.  The  ultrasound,  on  the  other  hand, 
will  reveal  if  there  is  internal  bleeding. 

(b)  Suppose  that  the  patient  gets  an  ultrasound  and  it  turns  out  that  there  is  no 
internal  bleeding.  What  is  the  probability  that  he  does  not  have  a treatable 
disease?  What  is  the  probability  that  he  has  cancer? 

(c)  If  instead  of  getting  the  ultrasound  he  had  taken  the  blood  test  and  it  had 
been  positive,  what  would  have  been  the  probability  that  he  had  a treatable 
disease? 

(d)  Now  let  us  go  back  to  the  hypothesis  that  the  patient  only  gets  the 
ultrasound  and  it  turns  out  that  there  is  no  internal  bleeding.  He  then  asks 
the  doctor:  “if  I were  to  take  the  blood  test  too,  how  likely  is  it  that  it 
would  be  positive?”.  What  should  the  doctor’s  answer  be? 

(e)  Finally,  suppose  that  the  patient  gets  both  the  ultrasound  and  the  blood  test 
and  the  ultrasound  reveals  that  there  is  no  internal  bleeding,  while  the 
blood  test  is  positive.  How  likely  is  it  that  he  has  a treatable  disease? 
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Exercise  8.4.  A and  B are  two  events  such  that  P(A)  = 0.2,  P(B)  = 0.5  and 
P(B  | ^4)  = 0.1.  Calculate  P(A  \ B)  using  Bayes’  rule. 


Exercise  8.5.  A lab  technician  was  asked  to  mark  some  specimens  with  two 
letters,  one  from  the  set  {A,  B,  C}  and  the  other  from  the  set  {E,F,G}.  For 
example,  a specimen  could  be  labeled  as  AE  or  BG,  etc.  He  had  a total  of  220 
specimens.  He  has  to  file  a report  to  his  boss,  consisting  of  the  following  table. 


LABEL 

number 

AE 

AF 

AG 

BE 

BF 

BG 

CE 

CF 

CG 

Unfortunately,  he  does  not  remember  all  the  figures.  He  had  written  some 
notes  to  himself,  which  are  reproduced  below.  Help  him  fill  in  the  above  table 
with  the  help  of  his  notes  and  the  conditional  probability  formula.  Here  are  the 
technician’s  notes: 

(a)  Of  all  the  ones  that  he  marked  with  an  E,  one  fifth  were  also  marked  with 
an  A and  one  fifth  were  marked  with  a B. 

(b)  He  marked  36  specimens  with  the  label  CE. 

(c)  Of  all  the  specimens  that  he  marked  with  a C,  were  marked  with  a G. 

(d)  Of  all  the  specimens,  J|  were  marked  with  a C. 

(e)  The  number  of  specimens  marked  BG  was  twice  the  number  of  specimens 
marked  BE. 

(f)  Of  all  the  specimens  marked  with  an  A,  were  marked  with  an  E. 

(g)  Of  all  the  specimens  marked  with  an  A,  one  tenth  were  marked  with  a G. 
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8.E.3.  Exercises  for  Section  8.3:  Belief  revision 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  8.6.  Prove  that  an  AGM  belief  revision  function  satisfies  Arrow’s 
Axiom  (Remark  8.2). 


Exercise  8.7.  Let  W = {a,b,c,d,e,g,h,k,m}  and  let  ^ be  the  following  plausibility 
order  on  W (we  use  the  convention  that  if  the  row  to  which  state  w belongs  is 
above  the  row  to  which  state  w belongs  then  w -<  w' , and  if  w and  W belong 
to  the  same  row  then  w ~ w' ) . 

most  plausible  b,  g 

c, k,  m 

d, h 
e 

least  plausible  a 

Let  £=  [{a,e},{d,e,k,m},{b, d,e,k},W} . Find  the  belief  revision  function 
/ : £ — » 2"  that  is  rationalized  by  ^ . 


Exercise  8.8.  As  in  Exercise  8.7,  let  W = {a,b,c,d,e,g,h,k,m}  and 

£=<{a,e},{d,e,k,m},{b,d,e,k},W> . Using  the  plausibility  order  of  Exercise 

[ E F G ) 

8.7,  find  a collection  of  probability  distributions  [PE,PF,PG,PW]  that  provides 

an  AGM  probabilistic  belief  revision  policy  (Definition  8.9).  [There  are  many: 
find  one.] 
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8.E.4.  Exercises  for  Section  8.4:  Harsanyi  consistency  of  beliefs 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  8.9.  Consider  the  following  knowledge-belief  structure: 


Player  2: 

Are  the  individuals’  beliefs  Harsanyi  consistent? 

Exercise  8.10.  Consider  the  following  knowledge-belief  structure: 


(a)  Show  that  if  p = -jj  then  the  beliefs  are  Harsanyi  consistent. 

(b)  Show  that  if  then  the  beliefs  are  not  Harsanyi  consistent. 
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8.E.5.  Exercises  for  Section  8.5:  Agreeing  to  disagree 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  8.11.  Consider  the  following  knowledge-belief  structure: 


1: 

2: 


gf 


(a)  Find  the  common  knowledge  partition. 

(b)  Find  a common  prior. 

(c)  Let  A = {b,d,f).  (c.l)  Is  the  probability  that  Individual  1 assigns  to  E 
common  knowledge?  (c.2)  Is  the  probability  that  Individual  2 assigns  to 
E common  knowledge? 

(d)  Let  E = {b,d,f}.  (d.l)  At  state  b does  Individual  1 know  what 
probability  Individual  2 assigns  to  A?  (d.2)  At  state  c does  Individual  1 
know  what  probability  Individual  2 assigns  to  A?  (d.3)  At  state  / does 
Individual  1 know  what  probability  Individual  2 assigns  to  A? 
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000000000000 

Exercise  8.12.  Challenging  Question. 

[This  is  known  as  the  Monty  Hall  problem.] 

You  are  a contestant  in  a show.  You  are  shown  three  doors,  numbered  1,  2 and 
3.  Behind  one  of  them  is  a new  car,  which  will  be  yours  if  you  choose  to  open 
that  door.  The  door  behind  which  the  car  was  placed  was  chosen  randomly 
with  equal  probability  (a  die  was  thrown,  if  it  came  up  1 or  2 then  the  car  was 
placed  behind  door  1,  if  it  came  up  3 or  4 then  the  car  was  placed  behind  door  2 
and  if  it  came  up  5 or  6 then  the  car  was  placed  behind  door  3).  You  have  to 
choose  one  door.  Suppose  that  you  have  chosen  door  number  1.  Before  the 
door  is  opened  the  show  host  tells  you  that  he  knows  where  the  car  is  and,  to 
help  you,  he  will  open  one  of  the  other  two  doors,  making  sure  that  he  opens  a 
door  behind  which  there  is  no  car;  if  there  are  two  such  doors,  then  he  will 
choose  randomly  with  equal  probability.  Afterwards  he  will  give  you  a chance 
to  change  your  mind  and  switch  to  the  other  closed,  but  you  will  have  to  pay 
$50  if  you  decide  to  switch.  Suppose  that  (initially  you  chose  door  1 and)  the 
host  opens  door  3 to  show  you  that  the  car  is  not  there.  Should  you  switch 
from  door  1 to  door  2?  [Assume  that,  if  switching  increases  the  probability  of 
getting  the  car,  then  you  find  it  worthwhile  to  pay  $50  to  switch.] 


\ 

/ 

i 


Let  us  approach  this  problem  in  two  different  ways. 

(a)  Method  1.  Draw  (part  of)  an  extensive  form  with  imperfect  information  where 
Nature  moves  first  and  chooses  where  the  car  is,  then  you  choose  one  door 
and  then  the  host  chooses  which  door  to  open  (of  course,  the  host’s  choice  is 
made  according  to  the  rules  specified  above).  Reasoning  about  the  information 
set  you  are  in  after  you  have  pointed  to  door  1 and  the  host  has  opened  door  3, 
determine  if  you  should  switch  from  door  1 to  door  2. 

(b)  Method  2.  Let  D„  denote  the  event  that  the  car  is  behind  door 
n e {1,2,3}.  Let  On  denote  the  event  that  the  host  opens  door  n.  The  prior 
probabilities  are  P(DX)  = P(D2)  = P(D3)  = } . Compute  P{DX  \ Of)  using  Bayes’ 
rule  (if  P(DX  \ Of)  = \ then  you  should  not  switch,  since  there  is  a cost  in 
switching). 
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Appendix  8.S:  Solutions  to  exercises 


Exercise  8.1.  The  probability  distribution  is 

state  abcdefg  hi 
probability 


(a) 


Let  E = {a,f,g,h,i} . 

-U-  + JL  + jL  + JL  + 0 = — = 

60  ^60  ' 60  ' 60  ' v 60 


Then 

_7_ 

15  ' 


P{E)  = P(a)  + P(f)  + P(g)  + P(h)  + P(i)  = 


(b)  Amy  is  certain  of  all  events  that  have  probability  1,  namely 
{a,c,d,e,f,g,h} , {a,b,c,d,e,f,g,h},  {a,c,d,e,f,g,h,i}  and 

{a,b,c,d,e,f,g,h,i} . 


Exercise  8.2.  Since  P(A)  = y P{w)  and  P{B)  = I P{w) , when  adding  P{A)  to 

we  A weB 

P(B ) the  elements  that  belong  to  both  A and  B (that  is,  the  elements  of  AnB) 
are  added  twice  and  thus  we  need  to  subtract  y P(w)  from  P(A)  + P(B)  in 

weAnB 

order  to  get  ^ P(w) , which  is  equal  to  P(A  u B) . 

weAuB 


Exercise  8.3.  The  probabilities  are  as  follows: 


b 

V 

C 

i 

140  _ 7 
600  ~ 30 

110  11 
600  ” 60 

90  _ 3 
600  ~ 20 

260  _ 13 
600  “ 30 

(a)  The  event  that  the  patient  has  a treatable  disease  is  {b,i}.  P({b,i})  = P(b)  + 


(b)  A negative  result  of  the  ultrasound  is  represented  by  the  event  {b,v,c}.  A 
non-treatable  disease  is  the  event  {v,c}.  Thus 


P[{v,c}\{b,v,c }) 


P({v,c}n{b,v,c}) 

P({b,v,c}) 


^ + i _io 
P({b,v,c})  m + + ^ 


P(c\{b,v,c}) 


p{c) 

P({b,v,c}) 


20 


30  ' 60  ' 20 


_9_ 
34  * 
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(c)  A positive  blood  test  is  represented  by  the  event  {b,v}.  A treatable  disease  is 
the  event  {b,i}.  Thus 


P({b,i}\{b,v})  = 


P(',b,i)n{  b,v})_  P(b) 


/>(i*,v})  />({*, v»  i + % 


30 __14 

.11  25  • 


p( [b  vi)  -A  + ii 

(d)  Here  we  want  P({b,  v}  | {b, v,c})  = — - ’ }\  = 30  60  =% . 

' ' ' ' P({b,v,c})  ir  + S + i M 

(e)  The  information  is  {b,v}  n {b,v,c}  = {b,v}  . Thus  we  want  P({b,i}  \ {b,  v}) 
which  was  calculated  in  Part  (c)  as  . 


Exercise  8.4.  P(A  | B)  = P^B  * = (Q~1)(Q~2)  = 0.04  . 

P(B)  0.5 


Exercise  8.5.  Let  P(xy)  be  the  fraction  of  the  total  number  of  specimens  that 
were  marked  xy  and  P(x  | y)  be  the  number  of  specimens  marked  with  an  x as  a 
fraction  of  the  specimens  that  were  marked  with  a y (this  is  like  a conditional 
probability).  This  is  how  we  can  re-write  the  information  contained  in  the 
technician’s  notes: 

(a)  P(A  | E)  = P(B  | E)  = -j . It  follows  that  the  remaining  three  fifths  were 
marked  with  a C,  that  is,  P(C|E)  = j . 


(b)  The  number  of  CEs  is  36.  Thus  P(CE)  Thus  P(C\E)  = 


P{CE ) 
P(E) 


Using  (a)  we  get  | = 220  , that  is,  P(E)  = = ^j.  Hence  the  number  of 

P(E) 

specimens  marked  with  an  E is  ^-220  = 60.  Furthermore, 

P(A  | E)  = ' ►p];lus,  usjng  l = P(AE) , ^ p^E)  _ _L  . Thus 

P(E) 

the  number  of  specimens  marked  AE  is  -^-220  = f2.  The  calculation  for 


P(B  | E)  is  identical;  thus  the  number  of  specimens  marked  BE  is  also  12. 
Thus,  so  far,  we  have: 
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LABEL 

number 

AE 

12 

AF 

AG 

BE 

12 

BF 

BG 

CE 

36 

CF 

CG 

(c)  P(G|C)  = §.  Thus  P(G  \C)  = ^—^~,  that  is,  %=P{CG\ 

w 23  P(C ) 23  P(C) 

(d)  P(C)  = ff . Thus,  using  (c)  we  get  ^ = ~~~~  , that  is,  P(CG)  = . Hence 

55 

the  number  of  specimens  marked  CG  is  t|220  = 48.  Since  P(C)  = J|,  the 
total  number  of  specimens  marked  with  a C is  H 220  = 92.  Since  36  were 
marked  CE  and  48  were  marked  CG,  it  follows  that  the  number  of 
specimens  marked  CF  is  92  -48  -36  = 8.  Thus,  so  far,  we  have: 


LABEL 

number 

AE 

12 

AF 

AG 

BE 

12 

BF 

BG 

CE 

36 

CF 

8 

CG 

48 

(e)  The  number  of  BGs  is  twice  the  number  of  BEs.  Since  the  latter  is  12,  the 
number  of  BGs  is  24. 


325 


GAME  THEORY  - Giacomo  Bonanno 


(f)  P(E  | A)  = . Since  P(E  \ A)  = G and,  from  (b),  P(AE)  = , we  have 

P(A) 

_3_ 

that  ■ Hence  P(A ) = ^ = yj-  . Thus  the  number  of  specimens 

marked  with  an  A is  yy220  = 80  . Since  P(A)  = -fj  and  (from  (d))  P(C)  = J| , 
it  follows  that  P(B)  = f --jy  -J|  = . Thus  the  number  of  specimens 

marked  with  a B is  y|220  = 48.  Of  these,  f2  were  marked  BE  and  24  were 

marked  BG.  Thus  the  number  of  specimens  marked  BF  is  48  -12  -24  = 12. 
Thus,  so  far,  we  have: 


LABEL 

number 

AE 

f2 

AF 

AG 

BE 

f2 

BF 

f2 

BG 

24 

CE 

36 

CF 

8 

CG 

48 

(g)  P(G|A)  = p^Q\A)  = ^^~.  From  (f)  we  have  that  P(A)  = j-\  ■ Thus 

P(A) 

■ Thus  P(AG)  = =yyy.  Thus  the  number  of  specimens 

IT 

marked  AG  is  -^220  = 8.  Since  the  number  marked  with  an  A is 
yj-220  = 80  and  those  marked  AE  are  12  and  those  marked  AG  are  8,  we  get 

that  the  number  of  specimens  marked  AF  is  80  -12  -8  = 60.  Thus  we  have 
completed  the  table: 
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LABEL 

number 

AE 

12 

AF 

60 

AG 

8 

BE 

12 

BF 

12 

BG 

24 

CE 

36 

CF 

8 

CG 

48 

Exercise  8.6.  Let  / : £ — » 2"  be  an  AGM  belief  revision  function.  Let  F,F  e £ 
be  such  that  EczF  and  £n/(f7)^0.  We  need  to  show  that 

/ (A)  = A n/(F) . By  definition  of  AGM  belief  revision  function, 

/(F)  = {we  F : tvZ  tv'  for  every  tv'  e F}  (1) 

and 

/ (F)  = \w  e F : tv  Z vv'  for  every  tv'  e F}  (2) 

Choose  an  arbitrary  we£n/ (F) . Then,  by  (1)  and  the  fact  that  £cF, 
vv  Z tv'  for  every  w'  eE  and  thus,  by  (2),  we/ (F)  . Hence  F n / (F)  cz  / (F) . 
Conversely,  choose  an  arbitrary  w,  e / (F)  . Then,  since  / (F)  cF,  vy  e F . We 
want  to  show  that  tv,  e / (F)  [so  that  tVj  e F n / (F)  and,  therefore, 
/(F)cFn /(F)].  Suppose  not.  Then,  by  (1),  there  exists  a tv,  e F such  that 
w2  -<  Wj . Select  a w3  e F n / (F)  (recall  that,  by  hypothesis,  Fn/(F)*0). 
Then,  by  (1)  (since  tv2,w3  e /(F)),  w3^tv2,  from  which  it  follows  (by 
transitivity  of  Z and  the  fact  that  tv,  -<  wl ) that  tv3  -<  vv, . But  then,  since 
tv3  e F , it  is  not  true  that  ty  Z tv'  for  every  tv'  e F , contradicting  - by  (2)  - the 
hypothesis  that  tv,  e f (F)  . 
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Exercise  8.7.  We  have  that  £=  {{a,e},{d,e,k,m},{b,d,e,k},W)  and  the 
plausibility  order  is 

most  plausible  b,g 

c,  k,  m 

d, h 

e 

least  plausible  a 

The  belief  revision  function  rationalized  by  this  plausibility  order  is  given  by 
f{{a,e])  = {e),  f({d,e,k,m})  = {k,m},  f({b,d,e,k})  = {b}  and  f(W)  = {b,g}. 

Exercise  8.8.  From  Exercise  8.7  we  get  that  j PF , Ph , PG , Pw } must  be  such  that 
Supp(PF)  = {ej  , Supp(PF)  = {k,  m} , Supp(PG)  = {b}  and  Supp(Pw)  = {b,g} . For 
every  full  support  probability  distribution  P0  on  W,  there  is  a corresponding 
collection  j PF , PF , PG , Pw } . For  example,  if  P0  is  the  uniform  distribution  on  W 
(that  assigns  probability  \ to  every  state)  then  the  corresponding 
{Pe,Pf,Pg,Pw}  is  given  by: 


state 

a 

b 

c 

d 

e 

g 

h 

k 

m 

PE 

1 

0 

0 

0 

0 

0 

0 

0 

0 

PF 

0 

0 

0 

0 

0 

0 

0 

1 

2 

1 

2 

Pa 

0 

1 

0 

0 

0 

0 

0 

0 

0 

Pw 

0 

1 

2 

0 

0 

0 

1 

2 

0 

0 

0 

If  P0  is  the  following  probability  distribution 

state  abcdeghk  m 

r0  50  50  50  50  50  50  50  50  50 

then  the  corresponding  { PE,PF,PG,PW}  is  given  by:  PE  and  PG  the  same  as 
above,  and  PF  and  Pw  as  follows: 

state  abcdeghk  m 
PF  00000001} 
pw  0 i 0 0 0 } 0 0 0 
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Exercise  8.10.  (a)  Assume  that  p = -^  (so  that  = ).  From  the  updating 

conditions  for  Individual  1 we  get  ph  =2 pa  and  pe=pf;  from  the  updating 

conditions  for  Individual  2 we  get  pc  =3 ph  and  pe  = 4 pd;  from  the  updating 

conditions  for  Individual  3 we  get  pc  = pd  and  p,  =12 ph.  All  these  equations 


have  a solution,  which  constitutes  a common  prior,  namely 


'a  b 

c 

d 

e 

f] 

1 2 

6 

6 

24 

24 

V 63  63 

63 

63 

63 

63  J 

(b)  From  the  equations  of  part  (a)  we  have  that  pf  = pe , pe  = 4pd,  pd=  pc 
and  pc  =3 ph,  from  which  it  follows  that  pf  =12 pb.  Thus,  updating  on  the 
information  set  {b,f\  of  Individual  3 we  get  that  we  must  have 

T1  10 

(1  - p)  = — = — = {§  and  thus  p = jj  ■ Flence,  if  p ^ -jj  there  is  no 

Pb+Pf  Ph+nPb 

common  prior. 


Exercise  8.11.  (a)  and  (b)  The  common  knowledge  partition  and  the  common 
prior  are  shown  below. 


CK  partition 
and  common 
prior 

(c.l)  Individual  1 assigns  probability  } to  E at  every  state.  Thus  his  assessment 
of  the  probability  of  E is  common  knowledge  at  every  state. 
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(c.2)  At  state  a Individual  2 assigns  probability  \ to  E,  while  at  state  b she 
assigns  probability  f to  E.  Thus  her  assessment  of  the  probability  of  E is 
not  common  knowledge  at  any  state. 

(d.l)  At  state  a Individual  l’s  information  set  is  {a,b};  since  at  a Individual  2 
assigns  probability  \ to  E,  while  at  b she  assigns  probability  f-  to  E,  it 
follows  that  at  a it  is  not  the  case  that  Individual  1 knows  Individual  2’s 
assessment  of  the  probability  of  E. 

(d.2)  At  state  c Individual  l’s  information  set  is  { c,d,e };  since  at  c Individual  2 
assigns  probability  j-  to  E,  while  at  d she  assigns  probability  \ to  E,  it 
follows  that  at  c it  is  not  the  case  that  Individual  1 knows  Individual  2’s 
assessment  of  the  probability  of  E. 

(d.3)  At  state/  Individual  l’s  information  set  is  {fg };  since  at/  Individual  2 
assigns  probability  \ to  E,  while  at  g she  assigns  probability  0 to  E,  it 
follows  that  at /it  is  not  the  case  that  Individual  1 knows  Individual  2’s 
assessment  of  the  probability  of  E. 

Exercise  8.12  (Challenging  question). 

(a)  The  extensive  form  is  as  follows  (‘cbn’  means  ‘the  car  is  behind  door  n, 
‘ch n’  means  ‘you  choose  door  n’). 

NATURE 


The  hypothesized  sequence  of  events  leads  to  either  node  x or  node  y (you 
first  choose  door  1 and  then  the  host  opens  door  3).  The  prior  probability 
of  getting  to  x is  jxT  = l while  the  prior  probability  of  getting  to  y is 
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j x 1 = T . The  information  you  have  is  {x,y}  and,  using  the  conditional 


probability  rule,  P({x}\{x,y})  = 
you  should  switch. 


P({x\r^{x,y\)  P({x\) 


P({x,y})  P({x,  y})  1 + ; 


■ = f . Thus 


(b)  By  Bayes’  rule,  P(D,  \ O,)  = ^ D' ^ P^D' ^ . We  know  that  P(D.)  = { and 

P(03)  3 


P(03 1 Dx)  = \ (when  the  car  is  behind  door  1 then  the  host  has  a choice 
between  opening  door  2 and  opening  door  3 and  he  chooses  with  equal 
probability).  Thus 


pmo,)= 


pm 


i 

6 


pm 


(♦) 


Thus  we  only  need  to  compute  P(Oj).  By  Bayes’  rule, 


P(0})  = p(03  | Dx ) P{DX ) + P(03  | A ) P(D2 ) + P(03\D3) P(D3 ) = 


p(o3IA)i+^(O3lAH+^(o3IAH  = Txi+iXi+0Xi  = i+i=i 

because  P(03  \ Dx)  = P(03  \ D2 ) = 1 and  P(03  \ D3)  = 0 . Substituting  \ for 

P(Oi)  in  (♦)  we  get  that  P{DX  \ 03)  = }.  Hence  the  updated  probability  that 
the  car  is  behind  the  other  door  (door  2)  is  f and  therefore  you  should 
switch. 


331 


GAME  THEORY  - Giacomo  Bonanno 


Chapter 


Common  Knowledge  of  Rationality 


9.1  Models  of  strategic-form  games 

In  Chapter  1 (Section  1.5)  and  Chapter  5 (Section  5.4)  we  discussed  the 
procedure  of  iterative  elimination  of  strictly  dominated  strategies  and 
claimed  that  it  captures  the  notion  of  common  knowledge  of  rationality  in 
strategic-form  games.  We  now  have  the  tools  to  state  this  precisely. 

The  epistemic  foundation  program  in  game  theory  aims  to  identify,  for  every 
game,  the  strategies  that  might  be  chosen  by  rational  and  intelligent  players 
who  know  the  structure  of  the  game  and  the  preferences  of  their  opponents 
and  who  recognize  each  other’s  rationality.  The  two  central  questions  are  thus: 
(1)  Under  what  circumstances  is  a player  rational?  and  (2)  What  does  ‘mutual 
recognition  of  rationality’  mean?  A natural  interpretation  of  the  latter  notion  is 
in  terms  of  ‘common  knowledge  of  rationality’.17  Thus  we  only  need  to  define 
what  it  means  for  a player  to  be  rational.  Intuitively,  a player  is  rational  if  she 
chooses  an  action  which  is  “best”  given  what  she  believes.  In  order  to  make  this 
more  precise  we  need  to  introduce  the  notion  of  a model  of  a game.  We  shall 
focus  on  strategic-form  games  with  cardinal  payoffs  (Definition  5.2,  Chapter  5). 


17  As  pointed  out  in  Chapter  7 (Remark  7.1),  a defining  characteristic  of  knowledge  is  truth:  if, 
at  a state,  a player  knows  event  E then,  at  that  state,  E must  be  true.  A more  general  notion 
is  that  of  belief,  which  allows  for  the  possibility  of  mistakes:  believing  E is  compatible  with 
E being  false.  Thus  a more  appealing  notion  is  that  of  common  belief  of  rationality;  however, 
in  order  to  simplify  the  exposition,  we  shall  restrict  attention  to  the  notions  of  knowledge 
and  common  knowledge  developed  in  Chapter  7.  For  the  analysis  of  common  belief  and  an 
overview  of  the  epistemic  foundation  program  in  game  theory  the  reader  is  referred  to 
Battigalli  and  Bonanno  (1999). 
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The  definition  of  a strategic-form  game  specifies  the  choices  available  to  the 
players  and  what  motivates  those  choices  (their  preferences  over  the  possible 
outcomes);  however,  it  leaves  out  an  important  factor  in  the  determination  of 
players’  choices,  namely  what  they  believe  about  the  other  players.  Adding  a 
specification  of  the  players’  knowledge  and  beliefs  determines  the  context  in 
which  a particular  game  is  played;  this  can  be  done  with  the  help  of  an 
interactive  knowledge-belief  structure. 

Recall  that  an  interactive  knowledge-belief  structure  consists  of  a set  of  states 
W,  n partitions  Il,I2,...,In  of  W and  a collection  of  probability  distributions 
on  W,  one  for  each  information  set  of  each  partition,  whose  support  is  a subset 
of  that  information  set.  The  probability  distributions  encode  the  beliefs  of  the 
players  in  each  possible  state  of  knowledge.  Recall  also  that  if  w is  a state  we 
denote  by  Ifw)  the  element  of  the  partition  J(  that  contains  w,  that  is,  the 
information  set  of  Player  i at  state  w and  by  Pjw  the  probability  distribution 
on  Ifw)  , representing  the  beliefs  of  Player  i at  state  w. 

Definition  9.1.  Let  G be  a 77-player  strategic-form  game  with  cardinal  payoffs. 
A model  of  G is  an  interactive  knowledge-belief  structure  together  with  a 
function  cr.  :W  — » S'.  (for  every  i e{l, ...,«})  that  associates  with  every  state  a 
pure  strategy  of  Player  1 (recall  that  S,  denotes  the  set  of  pure  strategies  of 
Player  2). 18  The  interpretation  of  sj  = cr.(w)  is  that,  at  state  w,  Player  i plays  (or 
has  chosen)  strategy  s„  We  impose  the  restriction  that  a player  always  knows 
what  strategy  she  has  chosen,  that  is,  the  function  cr  is  constant  on  each 
information  set  of  Player  i:  if  W e I.  ( vv)  then  cr.(tv')  = cr  ( vv) . 

The  addition  of  the  functions  a,  to  an  interactive  knowledge-belief  structure 
yields  an  interpretation  of  events  in  terms  of  propositions  about  what  strategies 
the  players  choose,  thereby  giving  content  to  players’  knowledge  and  beliefs. 
Figure  9.1  shows  a strategic-form  game  and  a model  of  it. 


18 


In  Chapter  5 we  used  the  symbol  cr,  to  denote  a mixed  strategy  of  Player  i.  Mixed  strategies 
will  play  no  role  in  this  chapter  and  thus  there  will  be  no  possibility  of  confusion. 
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Figure  9.1 


State  P in  Figure  9.1  describes  the  following  situation. 

• Player  1 plays  B (cr,(/?)  = 5)  and  Player  2 plays  L (<r2(/?)  = L). 

• Player  1 (whose  information  set  is  {a, ft})  is  uncertain  as  to  whether 
Player  2 has  chosen  to  play  C ( <J2(a)  = C ) or  L (cr2(/?)  = L); 
furthermore,  he  attaches  probability  j to  each  of  these  two  possibilities. 

• Player  2 (whose  information  set  is  (y 3,y })  is  uncertain  as  to  whether 
Player  1 has  chosen  to  play  B (<jx (fi)  = B)  or  M (crl(/)  = M); 
furthermore,  she  attaches  probability  f to  Player  1 playing  B and  } to 
Player  1 playing  M. 

Are  the  two  players’  choices  rational  at  state  /?? 

Consider  first  Player  1.  Given  his  beliefs  and  his  choice  of  B,  his  expected 
payoff  is  \ tt, (5, C)  + \ kx(B,L)  =^(2)  + ^(8)  = 5 . Given  his  beliefs  about  Player 
2,  could  he  obtain  a higher  expected  payoff  with  a different  choice?  With  M his 
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expected  payoff  would  be  \ 7Tx(M,C)  + \ nx{M ,L)  = -f(0)  + -((0)  = 0 , while  with 
T his  expected  payoff  would  be  \ nx{T ,C)  + \ nx{T,L)  = i(3)  + |(4)  = 3.5  . 
Thus,  given  his  beliefs,  Player  l’s  choice  of  B is  optimal  and  we  can  say  that 
Player  1 is  rational  at  state  /?. 

Consider  now  Player  2.  Given  her  beliefs  and  her  choice  of  L,  her  expected 
payoff  is  f n1{B,L')  + } 7t2(M,L)  = f(3)  + }(9)  = 5 . Given  her  beliefs  about 
Player  1,  could  she  obtain  a higher  expected  payoff  with  a different  choice? 
With  C her  expected  payoff  would  be  f n2{B,C)  + \ tt2(M,C)  = 
f (4)  + }(0)  = §,  while  with  R her  expected  payoff  would  be  \n 2(B,R)  + 
j 7r2(M,R)  = f (0)  + }(12)  = 4 . Thus,  given  her  beliefs,  Player  2’s  choice  of  L is 
optimal  and  we  can  say  that  also  Player  2 is  rational  at  state  /?. 

One  can  view  a strategic-form  game  as  only  a partial  description  of  an 
interactive  situation:  it  specifies  who  the  players  are,  what  actions  they  can  take 
and  how  they  rank  the  possible  outcomes.  A model  of  the  game  completes  this 
description  by  also  specifying  what  each  player  actually  does  and  what  she 
believes  about  what  the  other  players  are  going  to  do.  Once  we  know  what  a 
player  does  and  what  she  believes  then  we  are  in  a position  to  judge  her  choice 
to  be  either  rational  or  irrational. 

The  following  definition  says  that  at  a state  (of  a model)  a player  is  rational  if 
her  choice  at  that  state  is  optimal,  given  her  beliefs,  that  is,  if  there  is  no  other 
pure  strategy  that  would  give  her  a higher  expected  payoff,  given  what  she 
believes  about  the  choices  of  her  opponents. 

Given  a state  w and  a Player  i,  we  denote  by  cr_((w)  the  profile  of  strategies 
chosen  at  w by  the  players  other  than  i : 

CR,  (w)  = ( tfi  ( w), . . .,  erM  (w),  cr+l  (tv), . ..,  cr„  (w)) . 

Furthermore,  as  in  Chapter  8,  we  denote  by  Pjw  the  beliefs  of  individual  i at 
state  w,  that  is,  Piw  is  a probability  distribution  over  7,(tv)  (although  we  can 
think  of  Piw  as  a probability  distribution  over  the  entire  set  of  states  W 
satisfying  the  property  that  if  w <£  7,(tv)  then  P.  W(w’)  = 0).  Note  that,  for  every 
Player  i,  there  is  just  one  probability  distribution  over  7(.(w),  that  is,  if 
w>  e 7.(w)  then  Pi  W = Pi  w. 


335 


GAME  THEORY  - Giacomo  Bonanno 


Definition  9.2.  Player  i is  rational  at  state  w if,  given  her  beliefs  Piw,  there  is 
no  pure  strategy  s'  e of  hers  that  yields  a higher  expected  payoff  than  cr  (tv) 
(recall  that  cr  (tv)  is  the  strategy  chosen  by  Player  i at  state  w): 

X pi,w(w')x  7ri(ai(w)’CT^i(w'))  - X P>,w(w')  x Tt^cr^w'))  , 

w'e/((w)  w'el^w) 

for  all  s'  e St . 

Given  a model  of  a game,  using  Definition  9.2  we  can  determine,  for  every 
player,  the  set  of  states  where  that  player  is  rational.  Let  /?,  be  the  event  that 
(that  is,  the  set  of  states  at  which)  Player  i is  rational  and  let  /?  = /?,  n...n  Rn  be 
the  event  that  all  players  are  rational. 

In  the  example  of  Figure  9.1  we  have  that  /?,  = {oc,j3} . Indeed,  it  was  shown 
above  that  /?  e if, ; since  Player  l’s  choice  and  beliefs  at  a are  the  same  as  at  /?, 
it  follows  that  also  a e Rx.  On  the  other  hand,  Player  1 is  not  rational  at  state  y 
because  he  is  certain  (that  is,  attaches  probability  1 to  the  fact)  that  Player  2 
plays  R and  his  choice  of  M is  not  optimal  against  R (the  unique  best  response 
to  R is  T).  The  same  is  true  of  state  8.  For  Player  2 we  have  that 
R2  = {a,  P,y,8) , that  is,  Player  2 is  rational  at  every  state  (the  reader  should 
convince  himself/herself  of  this).  Thus  R = {«,/?}• 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  9.E.1  of  Appendix  9.E  at  the 
end  of  this  chapter. 


9.2  Common  knowledge  of  rationality 
in  strategic-form  games 

Since  we  have  defined  the  rationality  of  a player  and  the  rationality  of  all  the 
players  as  events,  we  can  apply  the  knowledge  operator  and  the  common 
knowledge  operator  to  these  events;  thus  we  can  determine,  for  example,  if 
there  are  any  states  where  the  rationality  of  all  the  players  is  common 
knowledge  and  see  what  strategy  profiles  are  compatible  with  common 
knowledge  of  rationality. 
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REMARK  1.  Note  that,  for  every  player  i,  KR,  = Rh  that  is,  every  player  is 
rational  if  and  only  if  she  knows  it  (this  is  a consequence  of  the  assumption  that 
a player  always  knows  what  strategy  she  has  chosen  and  knows  her  own  beliefs: 
if  w'  e It(w)  then  cr.(w')  = cr.(w)  and  PjW  = Piw).  Note  also  that  CKR  c=  R,  that 

is,  if  it  is  common  knowledge  that  all  the  players  are  rational,  then  they  are 
indeed  rational.19 

In  Chapters  f (for  ordinal  strategic-form  games)  and  5 (for  cardinal  strategic- 
form  games)  we  claimed  that  the  iterated  deletion  of  strictly  dominated 
strategies  corresponds  to  the  notion  of  common  knowledge  of  rationality.  We 
are  now  able  to  state  this  precisely.  The  following  two  theorems  establish  the 
correspondence  between  the  two  notions  for  the  case  of  strategic-form  games 
with  cardinal  payoffs.  A corresponding  characterization  holds  for  strategic- 
form  games  with  ordinal  payoffs.20  The  proofs  of  Theorems  9.1  and  9.2  are 
given  in  Appendix  9. A. 

Theorem  9.1.  Given  a finite  strategic-form  game  with  cardinal  payoffs  G,  the 
following  is  true:  for  any  model  of  G and  any  state  w in  that  model,  if 
w e CKR  (that  is,  at  w there  is  common  knowledge  of  rationality)  then  the 
pure-strategy  profile  associated  with  w must  be  one  that  survives  the  iterated 
deletion  of  strictly  dominated  strategies  (Definition  5.6,  Chapter  5). 

Theorem  9.2.  Given  a finite  strategic-form  game  with  cardinal  payoffs  G,  the 
following  is  true:  if  s e S is  a pure-strategy  profile  that  survives  the  iterated 
deletion  of  strictly  dominated  strategies,  then  there  is  a model  of  G and  a state 
w in  that  model,  such  that  w e CKR  and  the  strategy  profile  associated  with  w 
is  s. 

As  an  application  of  Theorem  9.1  consider  again  the  game  of  Figure  9.1, 
reproduced  below  in  Figure  9.2. 


19  Indeed,  the  common  knowledge  operator,  like  the  individual  knowledge  operators,  satisfies 
the  Truth  property  (see  Remark  7.1  and  Exercise  7.5,  Chapter  7).  Note  that,  although 
CKR  c R,  the  converse  (that  is,  the  inclusion  R c CKR)  is  not  true:  it  is  possible  for  all  the 
players  to  be  rational  at  a state  without  this  fact  being  common  knowledge  among  them  at 
that  state.  For  example,  this  is  the  case  at  state  a in  the  model  of  Figure  9.1. 

20  See  Bonanno  (2015). 
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Player 

1 


L 

Player  2 
C 

R 

T 

4 , 6 

3 , 2 

OO 

O 

M 

0 , 9 

o 

o 

4 , 12 

B 

tn 

oo 

2 , 4 

o 

o 

Figure  9.2 


In  this  game  the  iterated  deletion  of  strictly  dominated  strategies  leads  to  the 
following  strategy  profiles:  (T,  L),  (T,  C),  (B,  L)  and  (B,  C).21  Thus  these  are  the 
only  strategy  profiles  that  are  compatible  with  common  knowledge  of 
rationality.  Hence,  by  Theorem  9.1,  at  a state  in  a model  of  this  game  where 
there  is  common  knowledge  of  rationality  the  players  can  play  only  one  of 
these  strategy  profiles.  Furthermore,  by  Theorem  9.2,  any  of  these  four 
strategy  profiles  can  in  fact  be  played  in  a situation  where  there  is  common 
knowledge  of  rationality.  It  is  worth  stressing  that  common  knowledge  of 
rationality  does  not  imply  that  the  players  play  a Nash  equilibrium:  indeed, 
none  of  the  above  four  strategy  profiles  is  a Nash  equilibrium.  However,  since 
a pure-strategy  Nash  equilibrium  always  survives  the  iterated  deletion  of 
strictly  dominated  strategies,  the  pure-strategy  Nash  equilibria  (if  any)  are 
always  compatible  with  common  knowledge  of  rationality. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  9.E.2  of  Appendix  9.E  at  the 
end  of  this  chapter. 


For  Player  1,  M is  strictly  dominated  by  any  mixed  strategy 


' B 
l P 1 ~P 


with  p > \ . After 


deletion  of  M,  for  Player  2 R becomes  strictly  dominated  by  either  of  the  other  two 
strategies. 
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9.3  Common  knowledge  of  rationality 
in  extensive-form  games 

So  far  we  have  only  discussed  the  implications  of  common  knowledge  of 
rationality  in  strategic-form  games,  where  the  players  make  their  choices 
simultaneously  (or  in  ignorance  of  the  other  players’  choices).  We  now  turn  to 
a brief  discussion  of  the  issues  that  arise  when  one  attempts  to  determine  the 
implications  of  common  knowledge  of  rationality  in  dynamic  (or  extensive- 
form)  games. 

How  should  a model  of  a dynamic  game  be  constructed?  One  approach  in  the 
literature  has  been  to  consider  models  of  the  corresponding  strategic-form. 
However,  there  are  several  conceptual  issues  that  arise  in  this  context.  In  the 
models  considered  in  Section  9.1  the  interpretation  of  y = cr  (w)  is  that  at  state 
w player  i “plays”  or  “chooses”  strategy  y . Now  consider  the  perfect- 
information  game  shown  in  Figure  9.3  below  and  a model  of  the  strategic-form 
game  associated  with  it.  Let  w be  a state  in  that  model  where  <Jx{w)  = (dx,ai) . 
What  does  it  mean  to  say  that  Player  1 “chooses”  strategy  (d{,a3)  ? The  first  part 
of  the  strategy,  namely  d \ , can  be  interpreted  as  a description  of  Player  l’s 
actual  behavior  (what  he  actually  does:  he  plays  dx),  but  the  second  part  of  the 
strategy,  namely  a3 , has  no  such  interpretation:  if  Player  1 in  fact  plays  dl  then 
he  knows  that  he  will  not  have  to  make  any  further  choices  and  thus  it  is  not 
clear  what  it  means  for  him  to  “choose”  to  play  o3  in  a situation  that  is  made 
impossible  by  his  decision  to  play  dl . 


1 0 3 

0 2 1 


Figure  9.3 
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Thus  it  does  not  seem  to  make  sense  to  interpret  cr, ( tv)  = (dva3)  as  ‘at  state  w 
Player  1 chooses  (dt,a3)  ’.  Perhaps  the  correct  interpretation  is  in  terms  of  a 
more  complex  sentence  such  as  ‘Player  1 chooses  to  play  dl  and  if  - contrary  to 
this  plan  - he  were  to  play  ax  and  Player  2 were  to  follow  with  a2 , then  Player  f 
would  play  a3  ’.  Thus,  while  in  a simultaneous  game  the  association  of  a 
strategy  of  Player  i to  a state  can  be  interpreted  as  a description  of  Player  z’s 
actual  behavior  at  that  state,  in  the  case  of  dynamic  games  this  interpretation  is 
no  longer  valid,  since  one  would  end  up  describing  not  only  the  actual  behavior 
of  Player  i at  that  state  but  also  his  counterfactual  behavior.  Methodologically, 
this  is  not  satisfactory:  if  it  is  considered  to  be  necessary  to  specify  what  a 
player  would  do  in  situations  that  do  not  occur  at  the  state  under 
consideration,  then  one  should  model  the  counterfactual  explicitly.  But  why 
should  it  be  necessary  to  specify  at  state  w (where  Player  f is  playing  d1 ) what 
he  would  do  at  the  counterfactual  node  following  actions  al  and  af  Perhaps 
what  matters  is  not  so  much  what  Player  1 would  actually  do  in  that  situation 
but  what  Player  2 believes  that  Player  f would  do:  after  all,  Player  2 might  not 
know  that  Player  f has  decided  to  play  dl  and  she  needs  to  consider  what  to  do 
in  the  eventuality  that  Player  f actually  ends  up  playing  al . So,  perhaps,  the 
strategy  of  Player  f is  to  be  interpreted  as  having  two  components:  (1)  a 
description  of  Player  f’s  behavior  and  (2)  a conjecture  in  the  mind  of  Player  2 
about  what  Player  f would  do.  If  this  is  the  correct  interpretation,  then  one 
could  object  - again  from  a methodological  point  of  view  - that  it  would  be 
preferable  to  disentangle  the  two  components  and  model  them  explicitly.22 

An  alternative  - although  less  common  - approach  in  the  literature  dispenses 
with  strategies  and  considers  models  of  games  where  (f)  states  are  described  in 
terms  of  players’  actual  behavior  and  (2)  players’  conjectures  concerning  the 
actions  of  their  opponents  in  various  hypothetical  situations  are  modeled  by 
means  of  a generalization  of  the  knowledge-belief  structures  considered  in 
Section  9.1.  The  generalization  is  obtained  by  encoding  not  only  the  initial 
beliefs  of  the  players  (at  each  state)  but  also  their  dispositions  to  revise  those 
beliefs  under  various  hypotheses.23 


22  For  an  extensive  discussion  of  these  issues  see  Bonanno  (forthcoming). 

23  The  interested  reader  is  referred  to  Perea  (2012). 
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A third  approach  has  been  to  move  away  from  static  models  (like  the  models 
considered  in  Section  9.1)  and  consider  dynamic  models  where  time  is 
introduced  explicitly  into  the  analysis.  These  are  behavioral  models  where 
strategies  play  no  role  and  the  only  beliefs  that  are  specified  are  the  actual 
beliefs  of  the  players  at  the  time  of  choice.  Thus  players’  beliefs  are  modeled  as 
temporal,  rather  than  conditional,  beliefs  and  rationality  is  defined  in  terms  of 
actual  choices,  rather  than  hypothetical  plans.24 

A discussion  of  the  implications  of  common  knowledge  (or  common  belief)  of 
rationality  in  dynamic  games  would  require  the  introduction  of  several  new 
concepts  and  definitions.  For  the  sake  of  brevity,  we  shall  not  pursue  this  topic 
in  this  book. 


The  interested  reader  is  referred  to  Bonanno  (2014). 
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Appendix  9. A:  Proofs 

In  order  to  prove  Theorem  9.1  we  will  need  the  following  extension  of 
Theorem  5.3  (Chapter  5)  to  the  case  of  an  arbitrary  number  of  players.  A proof 
of  Theorem  9.3  can  be  found  in  Osborne  and  Rubinstein  (1994,  Lemma  60.1,  p. 
60). 

Theorem  9.3.  Consider  a finite  77-player  game  in  strategic  form  with  cardinal 
payoffs  ^/,(S,,.).e/  ) ■ Select  an  arbitrary  Player  i and  a pure  strategy  y of 

Player  i.  For  every  Player  j ^ i,  let  S1'  c;  S . be  a subset  of  j s set  of  pure 
strategies  and  let  S't  = S1'x...x/S';'_1  xS'+1  x...xS'n  be  the  Cartesian  product  of  the 
sets  S'  (j  T i).  Then  the  following  are  equivalent: 

(1)  There  is  a belief  of  Player  i on  SO  (that  is,  a probability  distribution 
P : S'_t  — > [0, 1] ) that  makes  y a best  response  (that  is,  for  every  x e Sn 

s'_jGS'_j  s'_  j G S'_  j 

(2)  y is  not  strictly  dominated  by  a mixed  strategy  of  Player  i in  the 
restriction  of  the  game  to  the  sets  of  pure  strategies  S[,...,S'_1,Si,S'i+1,S'n. 

Given  a finite  strategic-form  game  with  cardinal  payoffs  G = (7,(S;).e/ 

we  shall  denote  by  S"  the  set  of  pure  strategies  of  Player  i that  survive  the 
iterated  deletion  of  strictly  dominated  strategies  (Definition  5.6,  Chapter  5)  and 
by  S00  the  corresponding  set  of  strategy  profiles  (that  is,  S'  = S™  x...xS™). 

Proof  of  Theorem  9.1.  Consider  a finite  strategic-form  game  G with  cardinal 
payoffs  and  an  epistemic  model  of  G.  Let  ©be  a state  in  that  model  such  that 
we  CKR . We  want  to  show  that  a(w)  = (cTj (w), ..., cr„ (w))  e S°° . We  shall  prove 
the  stronger  claim  that,  for  every  Player  i and  for  every  w'  e ICK(w ) (recall  that 
ICK  (w)  is  the  cell  of  the  common  knowledge  partition  that  contains  w)  and  for 
every  m>  0,  cr.(tv')  <£  D , where  D"'  c St  is  the  set  of  pure  strategies  of  Player  i 
that  are  strictly  dominated  in  game  G'"  ( G"‘  is  the  subgame  of  G obtained  at 
step  m of  the  iterated  deletion  procedure;  we  define  G°  to  be  G itself).  Note 
that  this  is  equivalent  to  stating  that,  for  every  w'  e ICK(w ) , cr(w')  e S” . Since 
w e ICK(w ) , it  follows  from  this  that  cr(w)  e Sx  . The  proof  is  by  induction. 
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1.  BASE  STEP  (m  = 0).  Select  an  arbitrary  W e I CK  ( vv)  and  an  arbitrary  Player 

1.  Since  w e CKR , ICK  (tv)  c:  R and  thus  vv'  e R ; furthermore,  since  R c;  if, , 
tv'  e i?( . Thus  cr.(tv')  is  a best  response  to  the  beliefs  of  Player  i at  state  tv'. 
Hence,  by  Theorem  9.3,  cr.(tv')  £ Z)° . 

2.  INDUCTIVE  STEP:  assuming  that  the  statement  is  true  for  all  k < m (for 
some  m>  0)  we  prove  that  it  is  true  for  k+\.  The  hypothesis  is  that,  for  every 
Player  i and  for  every  tv'  e ICK(w ) , cr.(tv')  <£  D1:  ; that  is,  for  every  tv'  e I CK ( vv) , 

cr(tv')  e Sk+] . Select  an  arbitrary  vv'  e I CK  ( vv)  and  an  arbitrary  Player  i.  Since 
tve  CKR,  tv'  e Ri  and  thus  at(w')  is  a best  reply  to  Player  is  beliefs  at  tv' 
which,  by  hypothesis,  attach  positive  probability  only  to  strategy  profiles  of 
the  other  players  that  belong  to  (note  that,  since  tv'  e I CK  ( tv) , 

Ick(w')  = ICk(w)  and,  by  definition  of  common  knowledge  partition, 
2, (tv')  c;  I CK ( tv') ) . By  Theorem  9.3,  it  follows  that  cr;.(tv')  i Z)(A+1 . 


Proof  of  Theorem  9.2.  Given  a strategic-form  game  with  cardinal  payoffs 
G = {l,(Sj)iel  ,(^-)/e/),  construct  the  following  epistemic  model  of  G: 

W = S'  and,  for  every  player  i and  every  s e S™ , I^s)  = |^'  e S'  : s'  = 5(  | (that 
is,  the  information  set  of  Player  i that  contains  s consists  of  all  the  strategy 
profiles  in  S'*  that  have  the  same  strategy  of  Player  i as  in  s);  finally,  let 
cr  (s)  = sr  Select  an  arbitrary  s e S°°  and  an  arbitrary  Player  i.  By  definition  of 
Sx  , it  is  not  the  case  that  y is  strictly  dominated  in  the  restriction  of  G to  the 
sets  of  pure  strategies  S’",..., 5“j, S1,., 5"15 5" » Thus,  by  Theorem  9.3,  there  is  a 
probability  distribution  over  5"  that  makes  = c^s)  a best  reply.  Choose 
one  such  probability  distribution  and  let  that  probability  distribution  give 
Player  is  beliefs  at  s.  Then  s e R; . Since  i was  chosen  arbitrarily,  s e R;  hence, 

since  s e S'  was  chosen  arbitrarily,  R = S \ It  follows  that  s e CKR  for  every 
s e S . 
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Appendix  9.E:  Exercises 


9.E.I.  Exercises  for  Section  9.1: 

Models  of  strategic-form  games 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  9.1.  Consider  the  following  game  (where  the  payoffs  are  von 
Neumann-Morgenstern  payoffs) 

Player  2 


Player 

1 


L 

M 

if 

A 

3 , 5 

2 , 0 

2 , 2 

B 

5 , 2 

1 , 2 

2 , 1 

C 

9 ,0 

1 , 5 

3 , 2 

and  the  following  model  of  this  game 


a 

b 

c 

d 

Player  2 

□ 

*1/2 

H n 

a 

b 

c 

d 

l's  strategy: 

c 

C 

A 

A 

2's  strategy: 

M 

L 

L 

R 

Player  1 


1 

* 1/2 

• 1/2 

* 1/2 

* 1/2 

J 

(a)  Find  the  event  if,  (that  is,  the  set  of  states  where  Player  1 is  rational). 

(b)  Find  the  event  if2  (that  is,  the  set  of  states  where  Player  2 is  rational). 

(c)  Find  the  event  if  (that  is,  the  set  of  states  where  both  players  are 
rational). 


344 


GAME  THEORY  - Giacomo  Bonanno 


Exercise  9.2.  Consider  the  following  three-player  game  and  model  of  it. 

Player  2 Player  2 


CD  CD 


Player  A 

2,3,2 

1,0,4 

A 

o 

o 

u> 

2,9,7 

1 B 

3,6,4 

0,8,0 

B 

2,1,3 

0,3,1 

Player  3 chooses  E Player  3 chooses  F 


1: 


a | Pi 

V J 

y | § 1 

S > 

f ■v. 

SO  £ 1 

L ^ 

2: 


a 


P 


Yi 


§1 


P - 

b 3 


PI 

S 2 

°4 

Y|  §{ 

cjj  : A A 

a0  : C D 


B B B B 

C D C D 


c>3  : E F E E 


F F 


(a)  Find  the  event  if,  (that  is,  the  set  of  states  where  Player  1 is  rational). 

(b)  Find  the  event  if,  (that  is,  the  set  of  states  where  Player  2 is  rational). 

(c)  Find  the  event  if,  (that  is,  the  set  of  states  where  Player  3 is  rational). 

(d)  Find  the  event  if  (that  is,  the  set  of  states  where  all  players  are  rational). 


345 


GAME  THEORY  - Giacomo  Bonanno 


9.E.2.  Exercises  for  Section  9.2: 

Common  knowledge  of  rationality 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  9.3.  (a)  For  the  game  and  model  of  Exercise  9.f  find  the  following 
events:  (t)  KXR2 , (it)  K2RX , (Hi)  K2KXR2,  (w)  KXK2RX , ( v ) CKRX , (vt)  CKR2, 
(vu)  CKR . 

(b)  Suppose  that  you  found  a model  and  a state  w in  that  model  such  that 
w e CKR . What  strategy  profile  could  you  find  at  wi 


Exercise  9.4.  (a)  For  the  game  and  model  of  Exercise  9.2  find  the  following 
events:  (t)  K R , (it)  K2R , (iii)  K.R , (iv)  CKR, , (v)  CKR . 

(b)  Suppose  that  you  found  a model  and  a state  w in  that  model  such  that 
w e CKR . What  strategy  profile  could  you  find  at  wi 


Exercise  9.5.  For  the  game  of  Exercise  9.1  construct  a model  where  there  is  a 
state  at  which  there  is  common  knowledge  of  rationality  and  the  strategy 
profile  played  there  is  (C,L).  [Hints:  (1)  four  states  are  sufficient,  (2)  it  is  easiest 
to  postulate  degenerated  beliefs  where  a player  assigns  probability  1 to  a 
particular  state  in  his  information  set.] 


Exercise  9.6.  For  the  game  of  Exercise  9.2  construct  a model  where  there  is  a 
state  at  which  there  is  common  knowledge  of  rationality.  [Hint:  think 
carefully,  you  don’t  need  many  states!] 
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000000000000 


Exercise  9.7  (Challenging  Question).  Consider  the  following  extensive-form 
game  (where  o1,o2,o3  and  o4  are  the  possible  outcomes) 


1 


A 


2 


D 


d 


i 

°\ 


1 

02 


G 


i 


c 


6 

6 

6 


4 

4 

4 


2 

2 

2 


0 Player  l's  payoff 
0 Player  2's  payoff 
0 Player  3's  payoff 


(a)  What  is  the  backward-induction  outcome? 

(b)  Write  the  strategic-form  game  associated  with  the  extensive-form  game  and 
find  all  the  outcomes  that  are  supported  by  a pure-strategy  Nash 
equilibrium. 

(c)  Considering  models  of  the  strategic  form,  what  strategy  profiles  are 
compatible  with  common  knowledge  of  rationality? 

(d)  Choose  a strategy  profile  which  is  not  a Nash  equilibrium  and  construct  a 
model  of  the  strategic  form  where  at  a state  there  is  common  knowledge  of 
rationality  and  the  associated  strategy  profile  is  the  one  you  selected. 
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Appendix  9.S:  Solutions  to  exercises 

Exercise  9.1.  (a)  Rx  = {a,b}  . At  states  a and  b Player  1 believes  that  Player  2 is 
playing  either  L or  M with  equal  probability.  Thus  his  expected  payoff  from 
playing  A is  y(3)  + y(2)  = 2.5,  from  playing  B is  y(5)  + y(l)  = 3 and  from 
playing  C is  y(9)  + y(l)  = 5.  Hence  the  best  choice  is  C and  this  is  indeed  his 
choice  at  those  two  states.  Thus  a,b  e R{.  At  states  c and  d Player  1 plays  A 
with  an  expected  payoff  of  -|(3)  + y(2)  = 2.5  but  he  could  get  a higher  expected 
payoff  with  C (namely  |(9)  + y (3)  = 6);  thus  c,  d <£  Rx.  Hence  Rx={a,b). 

(b)  R2  = {a,b,c}  . (c )R  = {a,b}. 

Exercise  9.2.  (a)  For  each  state  let  us  calculate  Player  l’s  expected  payoff 

(denoted  by  E/r, ) from  playing  A and  from  playing  B.  At  states  a and  (3, 

Etfj  (A)  = (A,  C,  E)  + \nx  (A,  D,  F)  = \ (2)  + } (2)  = 2 , 

E^(B)=  j/r1(B,C,E)+  }x1(B,D,F)  = }(  3)+  y(0)  = 1.5. 

Thus  A is  optimal  and  hence,  since  (Jx(a)  = cr,(/?)  = A,  a,  ft  e Rx. 

At  states  y and  5,  Ett1(A)  = ^7tl(A,C,E)  + \7Zl(A,D,E)=^(2)  + \(\)=^?Ln<3 
E^-1(A)  = |^1(A,C,E')  + }^-1(A,T»,A)  = |(3)  + }(0)  = |.  Thus  B is  optimal  and 
hence,  since  <Jx(y)  = <JX(8)  = B , y,8  e Rx. 

At  states  s and  £ Player  1 assigns  probability  1 to  (D,F)  against  which  A is  the 
unique  best  reply  and  yet  cqfA)  = <JX(^)  = B ; hence  e,C,  £ if, . 

Thus  Rx  = {a,j3,y,  8} . 

(b)  For  each  state  let  us  calculate  Player  2’s  expected  payoff  (denoted  by  E tz2) 
from  playing  C and  from  playing  D.  At  states  a,  y and  e, 

E^2  (C)  = ( A,  C,  E)  + \x2  (B,  C,  E)  + ( B,  C,  F)  = \ (3)  + ±(6)  + }(1)  = f , 

E tt2(D)  = \k2  (A,  D,E)  + j n2  (B,  D,E)  + \ n2  (B,  D,  F)  = } (0)  + } (8)  + } (3)  = ^ . 
Thus  D is  optimal  and  hence,  since  cr2(a ) = a2(y ) = a2(s)  = C , a,y,s  <£  /?, . 

At  states  J3,  8 and 

E^2  (C)  = \n2  ( A,  C,  F)  + ±x2  ( B,  C,  E)  + ( B,  C,  F)  = i (0)  + 1 (6)  + 1 (1)  = f , 

E^2(D)  = ^2(d,AF)  + l^2(5,AF)  + }^2(5,AF)  = |(9)  + 7(8)  + K3)  = f- 

Thus  D is  optimal  and  hence,  since  cr,(/?)  = a2(S)  = <j2(£)  = D , /?, 8,  C,  e R? . 
Thus  R-,  = {/3,S,£} . 
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(c)  For  each  state  let  us  calculate  Player  3’s  expected  payoff  (denoted  by  E 7r3) 
from  playing  E and  from  playing  F.  At  states  a,  y and  8, 

Fin3(E)  = \7r3(A,C,E)  + \7t3(B,C,E)  + \7r3(B,D,E)  = \(2)  + \(4)  + \(Q>)  = 2.5, 
Wix3(F)  = \7r3(A,C,F)  + \7r3(B,C,F)  + \7r3(B,D,F)==\(3)  + \(3)  + \(\)  = 2.5. 
Thus  both  E and  F are  optimal  and  hence  a,y,S  e R3 . 

At  states  /?  and  s , 

E^3  (E)  = ±x3  (A,  D,  E)  + 1 ;r3  (B,C  ,E)  = } (4)  + }(4)  = 4, 

Eyr3(F)  = jyr3(A,D,F)  + j^(B,C,F)==j(7)  + i(3)  = 5, 

Thus  F is  optimal  and  hence,  since  cr3(/?)  = <y3{s)  = F , /3,s  e R3. 

At  state  Player  3 knows  that  Players  1 and  2 play  ( B,D ) and  she  is  best 
replying  with  F.  Thus  C,  e R3 . Hence  R3  = {a,j3,y,S,s,C} 

(d)  R = RlnR2nR3  = {/3,8}. 


Exercise  9.3.  In  Exercise  9.1  we  determined  that  /?,  = {a,b}  , R2  = {a,b,c}  and 
R = {a,b} . Thus 


(a)  (0  KxR2  = {a,b}  , (//)  K2Rl  - {a},  (Hi)  K2KXR2  = {a}  , (iv)  KXK2RX  = 0 . The 
common  knowledge  partition  consists  of  a single  information  set  containing  all 
the  states.  Thus  (v)  CKRX  = 0 , (vi)  CKR  , = 0 , (vii)  CKR  = 0 . 


(b)  By  Theorem  9.1,  at  a state  at  which  there  is  common  knowledge  of 
rationality  one  can  only  find  a strategy  profile  that  survives  the  iterated 
deletion  of  strictly  dominates  strategies.  In  this  game,  for  Player  1 strategy  B is 

f A 


strictly  dominated  by  the  mixed  strategy 


V 2 


; after  deleting  B,  for  Player 


2 J 


f L M\ 

2 strategy  R is  strictly  dominated  by  the  mixed  strategy  ] 1 . Thus  the 

V 2 2 ) 

iterated  deletion  of  strictly  dominated  strategies  yields  the  set  of  strategy 
profiles  {(A,L),(A,M),(C,L),(C,M)} . Hence  at  a state  where  there  is  common 
knowledge  of  rationality  one  could  only  find  one  of  these  four  strategy  profiles. 
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Exercise  9.4.  (a)  (/')  KXR  = 0 , (//)  K2R  = 0 , (///')  K3R  = 0 . The  common 

knowledge  partition  consists  of  a single  information  set  containing  all  the 
states.  Thus  (/ v)  CKR3  = {a,  (3,y,5,s,^}  , (v)  CKR  = 0 . 

(b)  By  Theorem  9.1,  at  a state  at  which  there  is  common  knowledge  of 
rationality  one  can  only  find  a strategy  profile  that  survives  the  iterated 
deletion  of  strictly  dominated  strategies.  Since  in  this  game  there  are  no  strictly 
dominated  strategies,  at  such  a state  one  could  find  any  strategy  profile. 

Exercise  9.5.  In  the  following  model  there  is  common  knowledge  of 
rationality  at  every  state.  At  state  a the  strategy  profile  played  is  ( C,L ). 


o-j : C A A C 

v2  ■ L L M M 
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Exercise  9.6.  Whenever  a game  has  a Nash  equilibrium  in  pure  strategies,  a 
one-state  model  where  that  Nash  equilibrium  is  played  is  such  that  there  is 
common  knowledge  of  rationality  at  that  state.  Thus  the  following  model 
provides  a possible  answer: 


A 

D 

F 


Exercise  9.7  (Challenging  Question). 

(a)  The  backward  induction  outcome  is  o4,  with  associated  strategy  profile 

(A,a,Q  and  payoffs  (6,6,6). 

(b)  The  strategic  form  is  as  follows  (the  Nash  equilibria  have  been  highlighted): 


Player  2 

d a 


Player  D 


1 A 


4,4,4  4,4,4 

2,2,2  0,0,0 


Player  2 

d a 

D 
A 


Player  3 chooses  G 


Player  3 chooses  C 
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The  Nash  equilibria  are:  (D,d,G),  (D,a,G),  (D,d,G)  and  (A,a,G).  Thus  the 
only  two  outcomes  sustained  by  a Nash  equilibrium  are  ox , with  payoffs 
(4,4,4),  and  o4,  with  payoffs  (6,6,6). 

(c)  By  Theorem  9.1,  the  outcomes  that  are  compatible  with  common 
knowledge  of  rationality  are  those  associated  with  strategy  profiles  that 
survive  the  iterated  deletion  of  strictly  dominated  strategies.  Since  no 
player  has  any  strictly  dominated  strategies,  all  the  outcomes  are 
compatible  with  common  knowledge  of  rationality. 

(d)  In  the  following  model  at  state  a the  players  choose  (A,  d,  G),  which  is  not 
a Nash  equilibrium;  furthermore,  there  is  no  Nash  equilibrium  whose 
associated  outcome  is  o2  (with  payoffs  (2,2,2)).  In  this  model  a e CKR  (in 
particular,  Player  l’s  choice  of  A is  rational,  given  his  belief  that  Player  2 
plays  d and  a with  equal  probability). 

1: 

2:  H 0 

3:  l~«~l  fp~l 

CK 

partition 

G : (A,d,G)  (A,a,C) 
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PART  IV 


Advanced  Topics  II: 

Refinements  of 
Subgame-Perfect  Equilibrium 
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A First  Attempt: 

Weak  Sequential  Equilibrium 

10.1  Assessments  and  sequential  rationality 

At  the  end  of  Chapter  6 (Section  6.3)  we  showed  that,  although  the 
notion  of  subgame-perfect  equilibrium  is  a refinement  of  Nash 
equilibrium,  it  is  not  strong  enough  to  eliminate  all  “unreasonable” 
Nash  equilibria.  One  reason  for  this  is  that  a subgame-perfect  equilibrium 
cr  allows  a player’s  strategy  to  include  a strictly  dominated  choice  at  an 
information  set  that  is  not  reached  by  the  play  induced  by  cr.  In  order  to 
eliminate  such  possibilities  we  need  to  define  the  notion  of  equilibrium  for 
dynamic  games  in  terms  of  a more  complex  object  than  merely  a strategy 
profile. 

Definition  10.1.  Given  an  extensive-form  game  G,  an  assessment  for  G is  a pair 
(cr,//)  , where  cr  is  a profile  of  behavioral  strategies  and  ju  is  a list  of  probability 
distributions,  one  for  every  information  set,  over  the  nodes  in  that  information 
set.  We  call  //  a system  of  beliefs. 

The  system  of  beliefs  //  specifies,  for  every  information  set,  the  beliefs  - about 
past  moves  - that  the  relevant  player  would  have  if  informed  that  her 
information  set  had  been  reached.  Consider,  for  example,  the  extensive-form 
game  of  Figure  10.1. 
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0 4 0 1 3 1 0 1 

3 6 3 0 1 3 2 1 

0 4 0 1 3 1 0 1 

Figure  10.1 

A possible  assessment  for  this  game  is  (cr,//)  with 

d e h k\  f x y w z\ 

3 , i 4 and  H = , 3 3 ; note  that, 

~4  ~4  J Jy  V3-  J 1 l) 

typically,  we  will  not  bother  to  include  in  //  the  trivial  probability  distributions 
over  singleton  information  sets.  The  interpretation  of  this  assessment  is  that 
Player  1 plans  to  play  a with  probability  { , b with  probability  f and  c with 
probability  Player  2 plans  to  play /if  her  information  set  {x,y}  is  reached 
and  to  mix  between  d and  e with  probabilities  and  respectively,  if  her 
decision  node  t is  reached;  Player  3 plans  to  mix  between  h and  k with 
probabilities  j and  j,  respectively,  if  his  information  set  {w,zj  is  reached. 
Furthermore,  Player  2 - if  informed  that  her  information  set  {x,y}  had  been 


a b c f g 
lliio 


A complete  specification  of  n would  be  ft  = 2 t , A where  s is  the  decision  node 

V r I I 3 I 2 2) 

of  Player  1 (the  root)  and  t is  the  decision  node  of  Player  2 following  choice  c. 
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reached  - would  attach  probability  \ to  Player  1 having  played  a (node  x)  and 
probability  } to  Player  1 having  played  b (node  y);  Player  3 - if  informed  that 
his  information  set  { w,z } had  been  reached  - would  attach  probability  \ to 
node  w (that  is,  to  the  sequence  of  moves  cd)  and  probability  \ to  node  z (that 
is,  to  the  sequence  of  moves  ce). 


In  order  for  an  assessment  (cr,ju)  to  be  considered  “reasonable”  we  will  impose 
two  requirements: 

1.  The  choices  specified  by  a should  be  optimal  given  the  beliefs  specified 
by  ju.  We  call  this  requirement  sequential  rationality. 

2.  The  beliefs  specified  by  //  should  be  consistent  with  the  strategy  profile 
a.  We  call  this  requirement  Bayesian  updating. 


Before  we  give  a precise  definition  of  these  concepts,  we  shall  illustrate  them 
with  reference  to  the  above  assessment  a = 


and  // : 


r a b 

c 

/ 

g 

d e 

h k' 

1 3 

V 8 8 

4 

8 

1 

0 

3 1 

4 4 

1 4 

5 5 J 

(x 

y 

W 

2 

i 

1 1 

V 3 

3 

2 2 J 

for  the  game  of  Figure  10.1.  This  assessment  fails  to 


satisfy  sequential  rationality,  because,  for  example,  at  Player  3’s  information 
set  {w, z}  the  planned  mixed  strategy 


i 
V 5 


± 

57 


W 


V 2 


yields  Player  3 - given  her  beliefs 
an  expected  payoff  of  |[}(3)  + |(l)]+v[i  (0)  + |(1)]  = lo  while  she 


27 


could  get  a higher  expected  payoff,  namely  |(3)  + |(0)  = jf,  with  the  pure 
strategy  h.  This  assessment  also  fails  the  rule  for  belief  updating  (Definition  8.3, 
Chapter  8);  for  example,  given  <j,  the  prior  probability  that  node  x is  reached  is 
P(x)  = | (it  is  the  probability  with  which  Player  1 plays  a)  and  the  prior 
probability  that  node  y is  reached  is  P(y)  = f (the  probability  with  which 
Player  1 plays  b),  so  that  updating  on  information  {x,y}  one  gets 
P(x)  P(x) 


P(x  I {x,y})  = ■ 


_ 1 


1 + 1 


i and  P(y  \ {x,  y})  = \.  Thus,  in 


P({x,y})  P(x)  + P(y) 
order  to  be  consistent  with  the  rule  for  belief  updating,  Player  2’s  beliefs  should 


be 


^x  jC 

i i. 

V 4 4 7 
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Now  we  can  turn  to  the  formal  definitions.  First  we  need  to  introduce  some 
notation.  If  cr  is  a profile  of  behavior  strategies  and  a is  a choice  of  Player  i (at 
some  information  set  of  Player  i),  we  denote  by  <j(a)  the  probability  that  cr 
(the  strategy  of  Player  i that  is  part  of  cr)  assigns  to  a.  For  example,  for  the 


game  of  Figure  10.1,  if  cr  = 


f a b 

c 

f 

g 

d e 

h F 

1 3 

V 8 8 

4 

8 

1 

0 

3 1 

4 4 

1 4 

5 5 J 

then  <j(b ) = f , 


cr(g)  - 0 , <j(d ) = | , etc.  Similarly,  if  ju  is  a system  of  beliefs  and  x is  a decision 
node,  then  we  denote  by  //(x)  the  probability  that  the  relevant  part  of 


//  assigns  to  x.  For  example,  if  // 


fs 

t 

X 

y 

v1 

1 

2 1 

1 1 

1 3 

3 1 

2 2 y 

or,  written  more 


succinctly,  //  = 


fx 

y 

W Z^ 

2 

i 

1 1 

V 3 

3 

2 2 / 

then  ju(s)  = 1,  /j(y ) = },  ju(w)  = j , etc. 


Recall  that  Z denotes  the  set  of  terminal  nodes  and,  for  every  Player  i, 
ni : Z — » M is  the  payoff  function  of  Player  i.  Given  a decision  node  x,  let 
Z(x) cZ  be  the  set  of  terminal  nodes  that  can  be  reached  starting  from  x.  For 
example,  in  the  game  of  Figure  fO.f,  Z(t)  = jz5,z6,z7,zg} . Given  a behavior 
strategy  profile  cr  and  a decision  node  x,  let  Pv  a be  the  probability  distribution 
over  Z(x ) induced  by  cr,  that  is,  if  zeZ(x)  and  is  the  sequence  of 

choices  that  leads  from  x to  z then  Pv  a ( z)  is  the  product  of  the  probabilities  of 
those  choices:  Pv  CT(z)  = cr(a1)xcr(o2)x...xcr(om).  For  example,  in  the  game  of 


Figure  10.1,  if  cr  is  the  strategy  profile  cr  = 


r a b 

c 

f 

g 

d e 

h hd 

1 3 

V 8 8 

4 

8 

1 

0 

3 1 

4 4 

1 4 

5 5 J 

and 


t is  Player  2’s  decision  node  after  choice  c of  Player  1,  then 
P, ct(z5)  = <j(d)<j(h)  = j-(j)  = jq.  If  H is  an  information  set  of  Player  i we 

denoted  by  n.  (H  \ cr,ju)  the  expected  payoff  of  Player  i starting  from 

information  set  H,  given  the  beliefs  specified  by  //  at  H and  given  the  choices 
prescribed  by  cr  at  H and  at  the  information  sets  that  come  after  H,  that  is, 


Ki  {H  I cr,//)  = ^ 


xgH 


//(x) 


X pv,a(zK(z) 

\^zgZ(x) 


For  example,  in  the  game  of  Figure  10.1,  if 
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cr  = 

' a b c 

1 3 4 

/ g 

1 0 

d e 

3 1 

h r 

1 4 

and  ju  - 

' x y 

2 1 

W N 
1 1 

V 8 8 8 

4 4 

5 5 J 

V 3 3 

2 2 J 

then  (as  we  computed  above) 

/r3({w,z}|cr,//)  = 

M(w)(KAZS^3(z5)  + KAZ6)X3(z6))  + V(z){KAZ7>)'r3(Z7)+K*(Zs)X3(zs))  = 

l[K3)+l(i)]+i[K0)+l(i)]=ji. 

Recall  that  if  cr  is  a strategy  profile  and  i is  a player,  then  cr.  denotes  the 
profile  of  strategies  of  the  players  other  than  i and  we  often  use  (cr.,cr  ;.)  as  an 
alternative  way  of  denoting  cr ; furthermore,  if  ri  is  a strategy  of  Player  is  we 
denote  by  (r;.,cr _.)  the  strategy  profile  obtained  from  cr  by  replacing  cr(.  with 
rj  (and  leaving  everything  else  unchanged). 

Definition  10.2.  Fix  an  extensive-form  game  and  an  assessment  (cr,//)  . We  say 
that  Player  i’s  behavior  strategy  cr.  is  sequentially  rational  if,  for  every 
information  set  H of  Player  i, 

T ( H I (cr.,oy;.),//)  > 7Ct  ( H I (r;.,cr  ,),//) , 
for  every  behavior  strategy  xi  of  Player  i. 

We  say  that  cr  is  sequentially  rational  if,  for  every  Player  i,  cr.  is  sequentially 
rational. 

Note  that  for  Player  i’s  strategy  cr.  to  be  sequential  rationality  it  is  not 
sufficient  (although  it  is  necessary)  that  at  every  information  set  H of  Player  i 
the  choice(s)  at  H prescribed  by  cr  be  optimal  (given  the  choices  of  the  other 
players  prescribed  by  cr _.):  we  need  to  check  if  Player  i could  improve  her 
payoff  by  changing  her  choice(s)  at  H and  possibly  also  at  information  sets  of  hers 
that  follow  H.  To  see  this,  consider  the  game  of  Figure  10.2  below. 
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1 


Let 
M = 


(cr,//)  be  the  assessment  where  cr  is  the  pure-strategy  profile  (o,(c,e))  and 
. For  Player  2 e is  rational  at  information  set  {x,y}  because, 


(s  t 

x y'' 

V1  0 

0 lj 

given  her  belief  that  she  is  making  her  choice  at  node  y,  e gives  her  a payoff  of 
1 while  /would  give  her  a payoff  of  0;  furthermore,  at  information  set  {s,t} 
- given  her  belief  that  she  is  making  her  choice  at  node  s and  given  her  future 
choice  of  e at  {x,y}  - choice  c is  better  than  choice  d because  the  former  gives  her 
a payoff  of  1 while  the  latter  a payoff  of  0.  However,  the  strategy  ( c,e ) (while 
sequentially  rational  at  {x,y})  is  not  sequentially  rational  at  {s,t}  because  - given 
her  belief  that  she  is  making  her  choice  at  node  s - with  ( c,e ) she  gets  a payoff  of 

1 but  if  she  switched  to  (d,f)  she  would  get  a payoff  of  2;  in  other  words,  Player 

2 can  improve  her  payoff  by  changing  her  choice  at  {s,t}  from  c to  d and  also 

2 

changing  her  future  planned  choice  at  {x,y}  from  e to  f Note  that  the  pure- 


2 It  is  possible  to  impose  restrictions  on  the  system  of  beliefs  /u  such  that  sequential  rationality 
as  defined  in  Definition  10.2  is  equivalent  to  the  weaker  condition  that,  at  every  information 
set  H,  the  corresponding  player  cannot  increase  her  payoff  by  changing  her  choice(s)  at  H 
only.  We  will  not  discuss  this  condition  here.  The  interested  reader  is  referred  to  Hendon  et 
al  (1996)  and  Perea  (2002). 
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strategy  profile  (a,(c,e))  is  not  a Nash  equilibrium:  for  Player  2 the  unique 
best  reply  to  a is  ( d,fi . 

An  example  of  a sequentially  rational  assessment  for  the  game  of  Figure  10.3 

, l . (L  M R A B c d\  , _fx  y 

below  is  <7  = and  M ~ , , 

^0  0 1 \ \ 1 0)  U ! 


Figure  10.3 


Let  us  first  verify  sequential  rationality  of  Player  3’s  strategy.  At  her 
information  set  {u,v,w} , given  her  beliefs,  c gives  a payoff  of 
}(5)  + f (1)  + }(2)  = 2 , while  d gives  a payoff  of  }(2)  + |(2)  + }(2)  = 2 . Thus  c is 
optimal  (as  would  be  d and  any  randomization  over  c and  d).  Now  consider 
Player  2:  at  his  information  set  {. x,y } , given  his  beliefs  and  given  the  strategy  of 
Player  3,  A gives  a payoff  of  |(5)  + -f(3)  = 3.5  and  B gives  a payoff  of 
j(2)  + f(4)  = 3.5 ; thus  any  mixture  of  A and  B is  optimal,  in  particular,  the 


A B 


mixture  l l is  optimal.  Finally,  at  the  root,  R gives  Player  1 a payoff  of  3, 

V 2 2 J 

L a payoff  of  |(4)  + |(1)  = 2.5  and  M a payoff  of  |(0)  + |(6)  = 3;  thus  R is 


optimal  (as  would  be  any  mixture  over  M and  R). 
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This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  10.E.1  of  Appendix  10.E  at 
the  end  of  this  chapter. 


10.2  Bayesian  updating  at  reached 
information  sets. 


The  second  requirement  for  an  assessment  to  be  “reasonable”  is  that  the  beliefs 
encoded  in  //  should  be  consistent  with  the  behavior  postulated  by  cr,  in  the 
sense  that  the  rule  for  belief  updating  (Definition  8.3,  Chapter  8),  should  be 
used  to  form  those  beliefs,  whenever  it  is  applicable.  We  shall  call  this  rule 
Bayesian  updating. 


Let  x be  a node  that  belongs  to  information  set  H and  let  Proo,  CT(x)  be  the 
probability  that  node  x is  reached  (from  the  root  of  the  tree)  if  cr  is 
implemented.3  Let  PTO0<  (ET)  = V be  the  probability  that  information 

yeH 


set  H is  reached  (that  is,  the  probability  that  some  node  in  H is  reached).  Then 
Bayesian  updating  requires  that  the  probability  that  is  assigned  to  x given  the 
information  that  H has  been  reached  be  given  by  the  conditional  probability 


KotAHY 


of  course,  this  conditional  probability  is  well  defined 


if  and  only  if  P roo,AH)  > 0 . 


Definition  10.3.  Consider  an  extensive-form  game  and  a behavioral  strategy 
profile  cr.  We  say  that  an  information  set  H is  reached  by  cr  if  Proo,  CT(//)  > 0 . 


3 That  is,  if  (a,, ...,«„)  is  the  sequence  of  choices  that  leads  from  the  root  to  x then 


F™„,,aW  = cr(a  i)x-xcr(fl,„)- 
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For  example,  in  the  game  of  Figure  10.1  partially  reproduced  below  (we 

fabcfgd  eh 

omitted  the  payoffs),  ifcr=12  3 iin  t^ien  {X’V}  is 

VTT^  ^ B y y 1 

reached  by  cr,  while  {w,z}  is  not.4 


Definition  10.4.  Given  an  extensive-form  game  and  an  assessment  (cr,//)  , we 
say  that  (cr,//)  satisfies  Bayesian  updating  at  reached  information  sets  if,  for 

every  information  set  H and  every  node  x e H , if  H is  reached  by  cr  (that  is,  if 

P (x) 

Koo,,AH ) > 0 ) then  ■ 

root. a x 1 / 


For  example,  in  the  game  of  Figure  10.1  partially  reproduced  above,  the 

'abcfgldehk]  (x  y w z\ 

assessment  cr  — .1,  , „ . , u=  ' satisfies 

ill  Oil-  - 10  ^ iiii 

V9  9 9 u 1 i 4 4 1 \ 6 6 4 4/ 

Bayesian  updating  at  reached  information  sets.  In  fact,  we  have  that 

^roo,,*  (*)=?»  Pra0,,a  O')  = I > P root,.  ( {x,  T}  ) = ? + f = f , ^ root, a C W)  = I (?)  = > 

P™o;,a(Z)  = l(l)=i  and  Proo^({W^})  = i + ^ = il-  ThuS 


4 


P™„,,CTW  = },  Proo(  (TG)  = f and  thus  v})  = f +j  = 1;  on  the  other  hand, 

p»l,,W  = (7(c)(r(rf)  = 0( t)  = 0 and  Pro<,(j<TG)  = o-(c)(j(e)  = 0(i)  = 0,  so  that  P^ifw.z})  = 0. 
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P frl  1 

~ r°0‘(]  ' n = f = i = Mx)  and>  similarly, 

Koo,,a((X’y})  I 

P (w)  _2_  P 

> =j6=l=  (w)  and  n^A_>_ 

P'OO'JM)  I WM) 


F,00f,g(.V) 


36 

12 

36 


= { = //(z). 


= Mt)  > 


Remark  10.1.  Note  that  the  condition  “Bayesian  updating  at  reached 
information  sets”  is  trivially  satisfied  at  every  information  set  H that  is  not 

reached  by  a,  that  is,  at  every  information  set  H such  that  Prao,  a(H)  = 0 . 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  10.E.2  of  Appendix  10.E  at 
the  end  of  this  chapter. 


10.3  Weak  sequential  equilibrium 

Our  objective  is  to  find  a refinement  of  subgame-perfect  equilibrium  that  rules 
out  “unreasonable”  equilibria.  Moving  from  strategy  profiles  to  assessments 
allowed  us  to  judge  the  rationality  of  a choice  at  an  information  set 
independently  of  whether  that  information  set  is  reached  by  the  strategy  profile 
under  consideration:  the  requirement  of  sequential  rationality  rules  out  choices 
that  are  strictly  dominated  at  an  information  set.  As  a first  attempt  we  define  a 
notion  of  equilibrium  based  on  the  two  requirements  of  sequential  rationality 
and  Bayesian  updating  at  reached  information  sets. 

Definition  10.5.  A weak  sequential  equilibrium  is  an  assessment  (cr,//)  which 
satisfies  two  requirements:  (1)  sequential  rationality  and  (2)  Bayesian  updating 
at  reached  information  sets. 


5 In  logic  a proposition  of  the  form  “if  A then  B”  ( A is  called  the  antecedent  and  B is  called  the 
consequent)  is  false  only  when  A is  true  and  B is  false.  Thus,  in  particular,  the  proposition  “if 
A then  B”  is  true  whenever  A is  false  (whatever  the  truth  value  of  B).  In  our  case  the 
antecedent  is  “ Proo(  a(H)  > 0 
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Before  we  discuss  the  properties  of  weak  sequential  equilibria  we  illustrate  the 
type  of  reasoning  that  one  needs  to  go  through  in  order  to  find  weak  sequential 
equilibria.  Consider  first  the  game  of  Figure  10.4. 


Nature 


0 3 10 


Figure  10.4 


Let  us  see  if  there  is  a weak  sequential  equilibrium  (cr, //)  where  Player  l’s 
strategy  in  a is  a pure  strategy.  The  set  of  pure  strategies  of  Player  1 is 

S1  = {{A,  C),  (B,  D),  (B,  C),  (A,  D)} 

The  strategy  of  Player  1 determines  the  beliefs  of  Player  2 at  her  information 
set  {x,y}  . Let  us  consider  the  four  possibilities. 


If  Player  l’s  strategy  is  (A,C),  then  Player  2’s  information  set  {x, y}  is 

reached  with  positive  probability  and  the  only  beliefs  that  are  consistent 

/ \ 

x y 

0 1 


with  Bayesian  updating  are 


so  that  - by  sequential  rationality  - 


Player  2 must  choose  E.  However,  if  Player  2’s  strategy  is  E then  at  node  s 
it  is  not  sequentially  rational  for  Player  1 to  choose  A.  Thus  there  is  no 
weak  sequential  equilibrium  where  Player  l’s  strategy  is  ( A,C ). 
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• If  Player  l’s  strategy  is  ( B,D ),  then  Player  2’s  information  set  {x,y}  is 
reached  with  positive  probability  and  the  only  beliefs  that  are  consistent 

• it.  • i • ( x y 

with  Bayesian  updating  are  ^ ^ 

Player  2 must  choose  F.  However,  if  Player  2’s  strategy  is  F then  at  node  s it 
is  not  sequentially  rational  for  Player  1 to  choose  B.  Thus  there  is  no  weak 
sequential  equilibrium  where  Player  l’s  strategy  is  ( B,D ). 


v 

, so  that  - by  sequential  rationality  - 

) 


If  Player  l’s  strategy  is  ( B,C ),  then  Player  2’s  information  set  {x,y}  is 
reached  (with  probability  1)  and  the  only  beliefs  that  are  consistent  with 


Bayesian  updating  are 


. Given  these  beliefs,  Player  2’s  payoff  from 


playing  E is  j(0)  + -|(l)  = f and  her  payoff  from  playing  F is 
j(3)  + f(0)  = f • Thus  any  mixed  strategy  is  sequentially  rational  for  Player 


2,  that  is,  for  any  p e [0, 1] . 


is  sequentially  rational.  At  node  s 


VP  i-pj 

choice  B is  sequentially  rational  for  Player  1 if  and  only  if  the  expected 
payoff  from  playing  B is  at  least  1 (which  is  the  payoff  from  playing  A): 
2p  + 0(1  - p)  > 1 , that  is,  p > 4- . At  node  t choice  C is  sequentially  rational 
for  Player  1 if  and  only  if  the  expected  payoff  from  playing  C is  at  least  2 
(which  is  the  payoff  from  playing  D):  3p  + 0(l-p)>2,  that  is,  /?>f. 
Hence,  if  f then  both  B and  C are  sequentially  rational.  Thus  we  have 
an  infinite  number  of  weak  sequential  equilibria  at  which  Player  l’s  strategy 
is  ( B,C ):  for  every  e [f  ,1  ] , (cr,//)  is  a weak  sequential  equilibrium, 

where 


' A B C D E F 4 _{x  y^ 

V°  1 1 0 p l~P J ^ U b 
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If  Player  l’s  strategy  is  [A,D),  then  Player  2’s  information  set  {x,y}  is  not 
reached  and  thus,  according  to  the  notion  of  weak  sequential  equilibrium, 


any  beliefs  are  allowed  there.  Let 


F 


be  Player  2’s  strategy.  From 


VP  i -p) 

previous  calculations  we  have  that  both  A and  D are  sequentially  rational  if 
and  only  if  P-2-  One  possibility  is  to  set  p = 0;  this  means  that  Player  2 
chooses  the  pure  strategy  F and  this  is  sequentially  rational  if  and  only  if  her 

(x  y ' 

yq  \-qy 


beliefs  are 


with  3g  + 0(l-g)  > 0<?  + l(l-g)  , that  is,  q > \ . Thus 


payoff  from  F payoff  from  E 

we  have  an  infinite  number  of  weak  sequential  equilibria:  (cr,  //)  is  a weak 
sequential  equilibrium,  where 


cr  = 


fA  B 

c 

D 

E 

f' 

» M = 

fx 

v0  1 

1 

0 

0 

ij 

u 

for  every  q e [j,l] . 


Next  consider  the  case  where  0 < p < \ , so  that  Player  2 uses  a mixed 
strategy.  This  can  be  sequentially  rational  if  and  only  if  the  payoff  from  E is 
equal  to  the  payoff  from  F,  that  is  (using  the  previous  calculations)  if  and 


only  if  her  beliefs  are 


l 

V4 


1 

4 J 


| . Thus  we  have  an  infinite  number  of  weak 
sequential  equilibria:  (cr,  //)  is  a weak  sequential  equilibrium,  where 

for  every  pe( 0,f-] 


cr  = 


f A B 

C 

D 

E F > 

v0  1 

1 

0 

P 1~PJ 

» M = 

1 3 

V 4 4/ 

Next  we  consider  the  more  complex  game  shown  in  Figure  10.5. 
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Nature 


Figure  10.5 


For  simplicity,  let  us  limit  the  search  to  pure-strategy  weak  sequential 
equilibria.  How  should  we  proceed?  The  first  thing  to  do  is  to  see  if  the  game 
itself  can  be  simplified;  in  particular,  one  should  check  if  there  are  any 
information  sets  where  there  is  a strictly  dominant  choice:  if  there  is  such  a 
choice  then,  no  matter  what  beliefs  the  relevant  player  has  at  that  information 
set,  sequential  rationality  requires  that  choice  to  be  selected.  In  the  game  of 
Figure  10.5  there  is  indeed  such  an  information  set,  namely  information  set 
{w,  z}  of  Player  2:  here  L is  strictly  better  than  R for  Player  2 at  both  nodes, 
that  is,  R is  strictly  dominated  by  L as  a choice.  Thus  we  can  simplify  the  game 
by  (1)  removing  this  information  set,  (2)  converting  nodes  w and  z into 
terminal  nodes  and  (3)  assigning  to  these  newly  created  terminal  nodes  the 
payoffs  associated  with  choice  L.  The  simplified  game  is  shown  in  Figure  10.6 
below. 
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Nature 


Figure  10.6 

Applying  the  same  reasoning  to  the  simplified  game  of  Figure  10.6,  we  can 
delete  the  two  singleton  decision  nodes  of  Player  3 and  replace  one  with  the 
payoff  associated  with  choice  F and  the  other  with  the  payoff  associated  with 
choice  H,  thus  obtaining  the  further  simplification  shown  in  Figure  10.7  below. 
In  this  game  there  are  no  more  information  sets  with  strictly  dominant  choices. 
Thus  we  have  to  proceed  by  trial  and  error.  Note  first  that,  at  any  weak 
sequential  equilibrium,  by  Bayesian  updating  Player  l’s  beliefs  must  be 
' r s 

ill' 

V 5 5 5 J 
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Nature 


0 10  0 

0 4 5 


Figure  10.7 


Let  us  see  if  in  the  simplified  game  of  Figure  10.7  there  is  a pure-strategy  weak 
sequential  equilibrium  where  Player  l’s  strategy  is  “play”.  When  Player  1 


chooses  “play”  then,  by  Bayesian  updating,  Player  2’s  beliefs  must  be 


1 
V 4 


2 
4 y 


Let  us  first  try  the  hypothesis  that  there  is  a weak  sequential  equilibrium 
where  (Player  l’s  strategy  is  “play”  and)  Player  2’s  strategy  is  e.  Then,  by 


Bayesian  updating,  Player  3’s  beliefs  must  be 


0 


, making  A the  only 


sequentially  rational  choice  at  information  set  {u,v} . However,  if  Player  3’s 
strategy  is  A then  Player  2,  at  her  information  set  {x,y} , gets  a payoff  of 
4 (10)  + 1(2)  = from  playing  d and  a payoff  of  j(3)  + |(4)  = ^ from 
playing  e.  Thus  e is  not  sequentially  rational  and  we  have  reached  a 
contradiction. 
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Let  us  now  try  the  hypothesis  that  there  is  a weak  sequential  equilibrium 
where  (Player  l’s  strategy  is  “play”  and)  Player  2’s  strategy  is  d.  Then,  by 


Bayesian  updating,  Player  3’s  beliefs  must  be 


i 

V2 


; hence  Player  3’s 


2 J 


expected  payoff  from  playing  A is  j(2)  + j (4)  = 3 and  his  expected  payoff 
from  playing  B is  4(0)  + ^(5)  = 2.5 . Thus  A is  the  only  sequentially  rational 
choice  at  information  set  {u,v}.  Hence,  by  the  previous  calculations  for 
Player  2,  d is  indeed  sequentially  rational  for  Player  2.  Thus  it  only  remains 
to  check  if  Player  f indeed  wants  to  choose  “play”.  Given  the  strategies  d 
and  A of  Players  2 and  3,  respectively,  Player  f gets  a payoff  of  2 from 
“pass”  and  a payoff  of  y(4)  + y(4)  + J-(l)  = y > 2 from  “play”.  Thus  “play”  is 
indeed  sequentially  rational.  Thus  we  have  found  a pure-strategy  weak 
sequential  equilibrium  of  the  game  of  Figure  f 0.7,  namely 


r pass  play 

d e 

A B ' 

V 0 1 

1 0 

1 0y 

M = 

r r s 

i i 

t 

3 

x y 

1 3 

U 

1 

1 

V 5 5 

5 

4 4 

2 

2/ 

This  equilibrium  can  be  extended  to  a weak  sequential  equilibrium  of  the 
original  game  of  Figure  10.5  by  adding  the  choices  that  led  to  the  simplified 
game  of  Figure  10.7  and  arbitrary  beliefs  at  information  set  { w,z }: 


cr  = 


M 


r pass  play 

d 

e 

A 

B 

F 

G 

H 

K 

L RG 

{ 0 1 

1 

0 

1 

0 

1 

0 

1 

0 

i oj 

G s 

t 

x y 

U V 

i i 

3 

1 3 

1 1 

V 5 5 

5 

4 4 

2 2 

w 

P 


z 

1- 


p) 


for  any  p e [0, 1] 


Are  there  any  other  pure-strategy  weak  sequential  equilibria?  This  question  is 
addressed  in  Exercise  10.7. 


We  now  explore  the  relationship  between  the  notion  of  weak  sequential 
equilibrium  and  other  equilibrium  concepts. 


Theorem  10.1.  Given  an  extensive-form  game  with  cardinal  payoffs  G, 
if  (cr,//)  is  a weak  sequential  equilibrium  of  G then  cr  is  a Nash  equilibrium 
of  G. 
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In  general,  not  every  Nash  equilibrium  can  be  part  of  a weak  sequential 
equilibrium.  To  see  this,  consider  the  extensive-form  game  of  Figure  10.8  that 
reproduces  the  game  of  Figure  6.12  (Chapter  6).  The  pure-strategy  profile  (b,fd) 
is  a Nash  equilibrium,  but  it  cannot  be  part  of  any  weak  sequential  equilibrium, 
because  - no  matter  what  beliefs  Player  3 has  at  her  information  set  - choice  d 
is  not  sequentially  rational  (it  is  strictly  dominated  by  c at  both  nodes  in  that 
information  set). 


1 

0 


Figure  10.8 


Thus  weak  sequential  equilibrium  is  a strict  refinement  of  Nash  equilibrium. 
Does  it  also  refine  the  notion  of  subgame-perfect  equilibrium?  Unfortunately, 
the  answer  is  negative:  it  is  possible  for  (cr,//)  to  be  a weak  sequential 
equilibrium  without  a being  a subgame-perfect  equilibrium.  To  see  this, 
consider  the  game  of  Figure  10.9,  which  reproduces  Figure  10.1. 
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Figure  10.9 


The  assessment  a = (b,(f,e),h)  with  jU 


x y w \ 

is  a weak  sequential 

0 1 1 Oj  4 


equilibrium  (the  reader  should  verify  this;  in  particular,  note  that  information 
set  {w,z}  is  not  reached  and  thus  Bayesian  updating  allows  for  arbitrary  beliefs 
at  that  information  set).  However,  (h,(/,e),A)  is  not  a sub  game-perfect 


equilibrium,  because  the  restriction  of  this  strategy  profile  to  the  proper 
subgame  that  starts  at  note  t of  Player  2,  namely  (e,  h ) , is  not  a Nash 


equilibrium  of  that  subgame:  h is  not  a best  reply  to  e. 


The  relationship  between  the  three  notions  of  Nash  equilibrium,  subgame- 
perfect  equilibrium  and  weak  sequential  equilibrium  is  illustrated  in  the  Venn 
diagram  of  Figure  10.10. 
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Nash  equilibria 


Subgame-perfect  equilibria 


r 



f ::::::::::::::: 

: : : : : : : : : : : : : : : 



y///////////////Z/AV/////////////// 

Weak  sequential 
equilibria 


Figure  10.10 


• Every  subgame-perfect  equilibrium  is  a Nash  equilibrium.  However,  there 
are  games  in  which  there  is  a subgame-perfect  equilibrium  that  is  not  part  of 
any  weak  sequential  equilibrium.  For  example,  in  the  game  of  Figure  10.8, 
(b,f,d)  is  a Nash  equilibrium  which  is  also  subgame-perfect  (because  there 
are  no  proper  subgames);  however,  choice  d is  strictly  dominated  and  thus  is 
not  sequentially  rational  for  Player  3,  no  matter  what  beliefs  he  has  at  his 
information  set.  Thus  ( b,f,d ) cannot  be  part  of  a weak  sequential 
equilibrium. 

• Every  weak  sequential  equilibrium  is  a Nash  equilibrium.  However,  as 
shown  in  the  example  of  Figure  10.9,  there  are  games  in  which  there  is  a 
weak  sequential  equilibrium  whose  strategy  is  not  a subgame-perfect 
equilibrium. 
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As  we  will  see  in  Chapter  11,  every  sequential  equilibrium  (Definition  11.2)  is  a 
weak  sequential  equilibrium  (Remark  11.1)  and  every  finite  extensive-form 
game  with  cardinal  payoffs  has  at  least  one  sequential  equilibrium  (Theorem 
11.2);  hence  the  following  theorem  is  a corollary  of  Remark  11.1  and  Theorem 
11.2. 


Theorem  10.2.  Every  finite  extensive-form  game  with  cardinal  payoffs  has  at 
least  one  weak  sequential  equilibrium  (possibly  in  mixed  strategies). 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  10.E.3  of  Appendix  10.E  at 
the  end  of  this  chapter. 
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Appendix  10. E:  Exercises 

10.E.1.  Exercises  for  Section  10.1: 

Assessments  and  sequential  rationality 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  10.1.  For  the  game  of  Figure  10.1,  reproduced  below,  check  whether 


the  assessment  cr  = 
sequentially  rational. 


a b c 

ill 
8 8 8 


h k d e 

10  0 1 


f g 

1 0 


x y w z 
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Exercise  10.2.  Consider  the  following  modification  of  the  game  of  Figure  10.1 
obtained  by  replacing  Player  3 with  Player  1 at  information  set  {w,z}  (note 
that  in  the  game  of  Figure  10.1  the  payoffs  of  Players  1 and  3 are  identical): 


sequentially  rational? 


y 

2 

3 


w z ' 

1 1 

2 2 J 


Exercise  10.3.  Consider  the  following  game: 

1 


M = 


t x y ' 

ill 
2 4 4 J 


sequentially  rational? 
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10. E. 2.  Exercises  for  Section  10.2: 

Bayesian  updating  at  reached  information  sets. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  10.4.  For  the  game  of  Figure  10.1  find  a system  of  beliefs  ju  such  that 
(cr,//)  satisfies  Bayesian  updating  at  reached  information  sets  (Definition  10.4), 


where  cr 


'a  b 

C f 

g 

d 

e h 

k" 

1 3 

V 8 8 

1 1 

0 

3 

4 

i i 

4 5 

4 

5J 

Exercise  10.5.  In  the  following  game,  let  cr  = 


r a 

b 

r 

c 

d 

e f) 

2 

Vio 

i 

10 

i 

10 

1 

0 

1 2 
3 3 J 

all  the  systems  of  beliefs  which,  combined  with  cr,  yield  assessments  that  satisfy 
Bayesian  updating  at  reached  information  sets. 
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10. E. 3.  Exercises  for  Section  10.3: 

Weak  Sequential  Equilibrium. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  10.6.  Find  all  the  pure-strategy  weak  sequential  equilibria  of  the 
following  game: 

Nature 


2 0 0 0 


Exercise  10.7.  In  the  game  of  Figure  10.7,  reproduced  below,  is  there  a pure- 
strategy  weak  sequential  equilibrium  where  Player  1 chooses  “pass”? 


Nature 


o io  o 

o 4 5 


2 

3 

2 


1 

2 

2 
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000000000000 

Exercise  10.8  (Challenging  Question). 

Player  1 can  take  action  C or  L and  Player  2 can  take  action  c or  f The  von 
Neumann-Morgenstern  payoffs  are  as  follows: 


Player 

1 


C 

L 


Player  2 

/ 


4,4 

6,3 

3,1 

5,2 

The  game,  however,  is  more  complex  than  the  above  strategic  form.  Player  1 
moves  first  and  chooses  between  C and  L.  He  then  sends  an  e-mail  to  Player  2 
telling  her  truthfully  what  choice  he  made.  However,  it  is  commonly  known 
between  the  two  that  a hacker  likes  to  intercept  e-mails  and  change  the  text. 
The  hacker  is  a computer  program  that,  with  probability  (l-^),  leaves  the  text 
unchanged  and,  with  probability  £,  changes  the  sentence  “I  chose  C”  into  the 
sentence  “I  chose  L”  and  the  sentence  “I  chose  L”  into  the  sentence  “I  chose  C”. 
This  is  commonly  known.  The  value  of  £ is  also  commonly  known.  Assume 
that  £ e(0,|). 

(a)  Draw  the  extensive-form  game. 

(b)  Find  all  the  pure-strategy  weak  sequential  equilibria. 

(c)  Are  there  any  weak  sequential  equilibria  (pure  or  mixed)  in  which  Player  2, 
when  he  receives  a message  from  Player  1 saying  “I  chose  L”  plays  / with 
probability  1? 
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Appendix  10. S:  Solutions  to  exercises 


Exercise  10.1.  The  assessment 


r a b 

C 

h 

k 

d 

e 

/ g' 

1 3 

V 8 8 

4 

8 

l 

0 

0 

1 

i oj 

x y 


u 


w 

i 


z 

1 
2 J 


is  sequentially  rational.  In  fact, 


• at  the  root,  a gives  Player  1 a payoff  of  0 and  so  do  b and  c (given  the 
strategies  of  Players  2 and  3).  Thus  any  mixture  of  a,  b and  c is  sequentially 

b i 


f . 


rational;  in  particular, 


V 8 8 8 J 


• at  Player  2’s  node  t,  given  Player  3’s  strategy,  d gives  Player  2 a payoff  of  1 
while  e gives  a payoff  of  2;  thus  e is  sequentially  rational; 

• given  /u,  at  information  set  {x,y},  f gives  Player  2 a payoff  of 
} (3)  + 1 (3)  = 3 while  g gives  y (6)  + f (0)  = 2 ; thus / is  sequentially  rational; 


• given  //,  at  information  set  {w,z},  h gives  Player  3 a payoff  of 
|(3)  + f-(0)  = 1.5  , while  k gives  1;  thus  h is  sequentially  rational. 


Exercise  10.2.  It  might  seem  that  the  answer  is  the  same  as  in  the  previous 
exercise,  because  the  calculations  at  the  various  information  sets  remain  the 
same;  however,  in  this  game  checking  sequential  rationality  at  the  root  involves 
checking  whether,  given  the  strategy  of  Player  2 (namely  if,e)),  Player  1 can 
increase  his  payoff  by  changing  his  entire  strategy,  that  is  by  changing  his 
choices  at  both  the  root  and  at  information  set  {w,z}.  Indeed,  if  Player  1 


changes  his  strategy  from 


increases  from  0 to  1.  Hence 


the  root. 


r a b 

C 

! h 

1 

k'] 

to 

'a 

b 

c 

! h 

1 

k ' 

1 3 

V 8 8 

4 

8 

1 1 

V 

0 

1 

! o 

b 

r a b 

C 

! h k 3 

1 3 

V 8 8 

4 

8 

i 1 

his  payoff 


is  not  sequentially  rational  at 
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Exercise  10.3.  The  game  under  consideration  is  the  following: 


2 0 0 2 

1 0 0 2.5 

0 4 12  0 


The  assessment  a = 


a b r c d e f 

0 0 1---- 

u u 1 5 5 3 3 


s t x y 

1131 


Till 
2 2 4 4 


sequentially  rational,  as  the  following  calculations  show. 


is 


• At  Player  3’s  information  set  {x,y},  e gives  Player  3 a payoff  of 

i(0)  + i (12)  = 3 and / a payoff  of  -|(4)  + j(0)  = 3;  thus  both  e and  / are 

optimal  and  so  is  any  mixture  of  e and/;  in  particular,  the  mixture 

• At  Player  2’s  information  set  {s,f},  c gives  Player  2 a payoff  of 

y(2)  + 4(0)  = 1 and  (given  the  strategy  of  Player  3)  d a payoff  of 

-j[y(l)  + f (0)]+  -j[}(0)  + f(f)]  = 1 ; thus  both  c and  d are  optimal  and  so  is 

any  mixture  of  c and  d;  in  particular  the  mixture 

• At  the  root,  r gives  Player  1 a payoff  of  2 and  (given  the  strategies  of  Players 
2 and  3)  a gives  a payoff  of  f (1) + ■§■[}  (2) + j (0)]  = j and  b a payoff  of 

j(0)  + f[j(0)  + j(2)]  = j . Thus  r is  sequentially  rational. 
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Exercise  10.5.  The  game  under  consideration  is 


Let  cr  = 


'a 

b 

r 

c 

d 

1_  J_ 

i 

1 

0 

1 2 

V io 

10 

10 

3 3 J 

. Since  only  information  set  {s,t}  is  reached 


by  cr,  no  restrictions  are  imposed  on  the  beliefs  at  information  set  {x,y} . Thus, 
for  every  p such  that  0 < p < 1 , the  system  of  beliefs  P = 

combined  with  cr  yield 
reached  information  sets. 


fs  t 

X 

y ^ 

2 1 
V 3 3 

p 

x~Pj 

combined  with  cr  yields  an  assessment  that  satisfies  Bayesian  updating  at 


Exercise  10.6.  The  game  under  consideration  is 

Nature 


2 0 0 0 
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We  restrict  attention  to  pure  strategies.  First  of  all,  note  that  - for  Player  1 - A 
is  sequentially  rational  no  matter  what  strategy  Player  2 chooses  and,  similarly, 
C is  sequentially  rational  no  matter  what  strategy  Player  2 chooses.  The 
strategy  of  Player  f determines  the  beliefs  of  Player  2 at  her  information  set 
{x,y}.  Let  us  consider  the  four  possibilities  (recall  that 
SX  = {(A,CUB,D),{B,CUA,D)}). 


If  Player  l’s  strategy  is  (A,C),  then  Player  2’s  information  set  {x,  y}  is 
reached  with  positive  probability  and  the  only  beliefs  that  are  consistent 


with  Bayesian  updating  are 


v0  1 j 


, so  that  both  E and  F are  sequentially 


rational  for  Player  2.  By  our  preliminary  observation  it  follows  that 


(( A,C),E ) with  beliefs 
(( A,C),F ) with  beliefs 


0 1 


is  a weak  sequential  equilibrium  and  so  is 


x y 

0 1 


If  Player  l’s  strategy  is  ( B,D ),  then  Player  2’s  information  set  {x,y}  is 
reached  with  positive  probability  and  the  only  beliefs  that  are  consistent 


with  Bayesian  updating  are 


vi  oy 


so  that  - by  sequential  rationality  - 


Player  2 must  choose  E.  However,  if  Player  2’s  strategy  is  E then  at  node  t it 
is  not  sequentially  rational  for  Player  1 to  choose  D.  Thus  there  is  no  pure- 
strategy  weak  sequential  equilibrium  where  Player  l’s  strategy  is  ( B,D ). 

If  Player  l’s  strategy  is  ( B,C ),  then  Player  2’s  information  set  {x,y}  is 
reached  (with  probability  1)  and  the  only  beliefs  that  are  consistent  with 


Bayesian  updating  are 


. Given  these  beliefs,  E is  the  only 

V 2 2 J 

sequentially  rational  choice  for  Player  2 (her  payoff  from  playing  E is 
j(2)  + 4(0)  = 1 , while  her  payoff  from  playing  F is  0).  Thus  (( B,C),E ) with 


beliefs 


1 

V 2 


1 

27 


is  a weak  sequential  equilibrium. 


• If  Player  l’s  strategy  is  (A,D),  then  Player  2’s  information  set  {x,  v}  is  not 
reached  and  thus,  according  to  the  notion  of  weak  sequential  equilibrium, 
any  beliefs  are  allowed  there.  In  order  for  D to  be  sequentially  rational  for 
Player  1 it  must  be  that  Player  2’s  pure  strategy  is  F.  In  order  for  F to  be 
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sequentially  rational  for  Player  2,  her  beliefs  must  be 
(( A,D),F ) with  beliefs 


y^ 


y^ 


0 1 


0 1 

is  a weak  sequential  equilibrium. 


Thus 


Summarizing,  there  are  four  pure-strategy  weak  sequential  equilibria: 


(1)  (( A,C),E ) with  beliefs 
(3)  ((5,C),is)  with  beliefs 


x y 

v0  1, 

rx  y^ 

J_  _1_ 

V 2 2) 


, (2)  ((^4,C),F)  with  beliefs 

and  (4)  (( A,D),F ) with  beliefs 


x y 

v0  1, 

^x  y^ 

v0  1, 


Exercise  10.7.  The  game  under  consideration  is 


Nature 


o io  o 

o 4 5 


When  Player  l’s  strategy  is  “pass”  then  it  is  much  easier  to  construct  a weak 
sequential  equilibrium,  because  there  are  no  restrictions  on  the  beliefs  at  the 


Ot  y^ 

v0  1 j 


information  sets  of  Players  2 and  3.  For  example,  we  can  choose  beliefs 

for  Player  2,  which  make  e the  only  sequentially  rational  choice,  and  beliefs 

\ 

u v 

for  Player  3,  which  make  B the  only  sequentially  rational  choice.  It 

v J 

only  remains  to  check  sequential  rationality  of  “pass”:  if  Player  1 chooses  “pass” 
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he  gets  a payoff  of  2,  while  if  he  chooses  “play”  he  gets  a payoff  of 
l(0)  + i(2)  + f(0)  = j<2,  so  that  “pass”  is  indeed  the  better  choice.  Thus  we 
have  found  the  following  weak  sequential  equilibrium  (note,  however,  that 


there  are  several  weak  sequential  equilibria):  cr  = 


r pass  play 

d e 

A B ' 

V 1 0 

0 1 

0 1, 

r r s 

t 

X 

y 

u 

i i 

V 5 5 

3 

5 

0 

l 

0 ij 

Exercise  10.8.  [Challenging  question], 
(a)  The  extensive  form  is  as  follows: 


4 

4 


6 

3 


3 

1 


5 

2 


(b)  Let  us  consider  the  pure  strategies  of  Player  1.  If  Player  1 plays  L then 
Player  2 assigns  probability  1 to  the  bottom  node  of  each  information  set 
and  responds  with /with  probability  1,  but  this  makes  Player  1 want  to 
deviate  to  C to  get  a payoff  of  6. 6 Thus  there  is  no  pure-strategy  sequential 


6 In  other  words,  if  Player  1 plays  L,  then  Bayesian  updating  requires  Player  2 to  assign 
probability  1 to  the  bottom  node  of  each  information  set  and  then  sequential  rationality 
requires  Player  2 to  play  / at  each  information  set,  so  that  (L,(f,f))  is  the  only  candidate  for  a 
weak  sequential  equilibrium  where  Player  1 plays  L.  But  L is  not  a best  reply  to  (f,f)  and  thus 
( ))  is  not  a Nash  equilibrium  and  hence  cannot  be  part  of  a weak  sequential  equilibrium. 
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equilibrium  where  Player  1 plays  L.  On  the  other  hand,  if  Player  1 plays  C, 
then  Player  2 assigns  probability  1 to  the  top  node  of  each  information  set 
and  thus  sequential  rationality  requires  her  to  respond  with  c at  each 
information  set,  which  makes  playing  C optimal  for  Player  1.  Thus 
(C,(c,c)),  with  beliefs  that  assign  probability  one  to  the  top  node  of  each 
information  set,  is  the  only  pure-strategy  weak  sequential  equilibrium. 

(c)  Suppose  that  Player  2 plays/  after  reading  the  message  “I  chose  L”,  that  is, 
at  her  information  set  on  the  right.  We  know  from  the  argument  in  part  (b) 
that  there  are  no  equilibria  of  this  kind  in  which  Player  1 chooses  a pure 
strategy,  so  Player  1 must  be  playing  a mixed  strategy.  For  him  to  be 
willing  to  do  so,  he  must  receive  the  same  payoff  from  playing  C or  L.  If  we 
let  p be  the  probability  with  which  Player  2 plays  c if  she  receives  the 
message  “I  chose  C”,  then  1 is  indifferent  when  (the  symbol  <=>  stands  for 
‘if  and  only  if’) 

{\-s)\Ap  + 6{\- p)\  + s6  = s\^>p  + 5{\- p)\  + {\-s)5  <=>  p = ^ ^ 

=*i  (O  =»i(£) 


Since  £e(0,{),  pe{\,\),  so  Player  2 randomizes  after  reading  “I  chose 

C”.  For  Player  2 to  be  willing  to  do  this,  she  must  be  indifferent  between  c 
and  /in  this  event.  Let  g e (0, 1)  be  the  probability  with  which  Player  1 
plays  C (so  that  1 -q  is  the  probability  of  L);  then  Bayesian  updating 

(1  ~e)q 


requires  that  Player  2 assign  probability 


of  her  information  set  on  the  left  and  probability 


(l-£)q  + £(l-q) 

£(l-q) 


to  the  top  node 


to  the 


(l-£)q  + £(l-q) 

bottom  node.  Then,  for  Player  2,  the  expected  payoff  from  playing  c at 
the  information  set  on  the  left  is 


7T2(C)  = - 


(1  ~e)q 


-4  + - 


s(i  -q) 


(l-£)q  + £(\-q) 
from  playing /is  7T2{f)  = ■ 


(\-£)q  + £(\-q) 

(1  ~e)q 


1 and  the  expected  payoff 

3 + — ^ 2.  Player  2 

(\-£)q + £(\-q)  {\-  £)q  + £{\-q) 

is  indifferent  if  these  two  are  equal,  that  is  if 

Aq{\  -£■)  + (!-  q)£  - 2>q(l  - £)  + 2(1  - q)£  , which  is  true  if  and  only  if  q = £ . 
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We  have  now  specified  behavior  at  all  information  sets.  To  ensure  that  the 
specified  behavior  is  an  equilibrium,  we  need  to  check  that /is  optimal  for 
Player  2 if  she  receives  the  message  “I  chose  L”.  This  will  be  true  if 

7t2(c  | "I  chose  L")  < ;r,(/ 1 "I  chose  L")  <=> 

since  s=q 

4qs  + 1(1  - g)(l  - s ) < 3 qs  + 2(1  - g)(l  -£■)<=> 

4s2  +(l-s)2  <3s2  +2(l-s)2  <=>  s<\. 

Since  we  have  assumed  that  Player  2 strictly  prefers  to  play /after 

receiving  the  message  “I  chose  L”.  Thus  we  have  constructed  the  following 
weak  sequential  equilibrium: 


Behavior  strategy  of  Player  1: 


C L 
y£  1 -Sj 


Behavior  strategy  of  Player  2:  at  the  information  set  on  the  left 

2 c f ^ 

(where  she  receives  the  message  “I  chose  C”):  1 \-4s 

y2-4s  2-4  s) 

at  the  information  set  on  the  right  (where  she  receives  the  message 

f c /A 


“I  chose  I”): 


0 1 


Player  2’s  beliefs  at  the  information  set  on  the  left  assign 
£{\-s) 


probability 


£(l-£)  + £(l-£) 


= j to  the  top  node  and  probability  j 


to  the  bottom  node  and  her  beliefs  at  the  information  set  on  the 


right  assign  probability 


probability  — — — — — 
V £2+{\-£)2 


£2+(l-£)2 

to  the  bottom  node. 


to  the  top  node  and 
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Sequential  Equilibrium 


11.1  Consistent  assessments 

The  stated  goal  at  the  beginning  of  Chapter  10  was  to  seek  a refinement 
of  the  notion  of  subgame-perfect  equilibrium  that  would  rule  out 
strictly  dominated  choices  at  unreached  information  sets.  The  notion 
of  weak  sequential  equilibrium  achieved  the  goal  of  ruling  out  strictly 
dominated  choices,  but  turned  out  not  to  be  a refinement  of  subgame-perfect 
equilibrium.  As  shown  at  the  end  of  Section  10.3  (Chapter  10),  it  is  possible  for 
the  strategy  profile  in  a weak  sequential  equilibrium  not  to  be  a subgame- 
perfect  equilibrium.  The  reason  for  this  is  that  the  only  restriction  on  beliefs 
that  is  incorporated  in  the  notion  of  weak  sequential  equilibrium  is  Bayesian 
updating  at  reached  information  sets.  At  an  information  set  that  is  not  reached 
by  the  strategy  profile  under  consideration  any  beliefs  whatsoever  are  allowed, 
even  if  those  beliefs  are  at  odds  with  the  strategy  profile. 

To  see  this,  consider  the  game  of  Figure  11.1  and  the  assessment  consisting  of 
the  pure-strategy  profile  cr  = (c,  d,f)  (highlighted  as  double  edges)  and  the 

f s t vi 

system  of  beliefs  that  assigns  probability  1 to  node  u:  q q ^ 
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Figure  11.1 


Since  Player  3’s  information  set  [s,t,u]  is  not  reached  by  cr,  Bayesian  updating 
imposes  no  restrictions  on  beliefs  at  that  information  set.  However,  attaching 
probability  1 to  node  u is  at  odds  with  cr,  because  in  order  for  node  u to  be 
reached  the  play  must  have  gone  through  Player  2’s  node  and  there,  according 
to  cr,  Player  2 should  have  played  d with  probability  1,  making  it  impossible 
for  node  u to  be  reached. 

Thus  we  need  to  impose  some  restrictions  on  beliefs  to  ensure  that  they  are 
consistent  with  the  strategy  profile  with  which  they  are  paired  (in  the 
assessment  under  consideration).  At  reached  information  sets  this  is  achieved  by 
requiring  Bayesian  updating,  but  so  far  we  have  imposed  no  restriction  on 
beliefs  at  unreached  information  sets.  We  want  these  restrictions  to  be  “just  like 
Bayesian  updating”.  Kreps  and  Wilson  (1982)  proposed  a restriction  on  beliefs 
that  they  called  consistency.  To  motivate  it,  note  that  if  cr  is  a completely  mixed 
strategy  profile  (in  the  sense  that  o(a)  > 0,  for  every  choice  a)  then  the  issue 
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disappears,  because  every  information  set  is  reached  with  positive  probability 
and  Bayesian  updating  yields  unique  beliefs  at  every  information  set.  For 
example,  in  the  game  of  Figure  11.1,  if  Player  1 uses  the  completely  mixed 


strategy 


r a b c 

V Pa  Pb  ^ Pa  Pb 


with  pa,pb  e (0,1)  and  pa  + ph  < \ and  Player  2 


uses  the  completely  mixed  strategy 


d e 
l Pd  l~Pd) 

Bayesian  updating,  Player  3’s  beliefs  must  be  p{s)  = 


with  pd  e (0, 1)  then,  by 

Pa 

Pa  + PbPd+PbO-Pd)  ’ 


P(t)  = 


PbPd 


Pa+PbPd+Pb^-Pd) 


and  p{u)  = 


PbQ-Pd) 


Pa+PhPd+PbQ-Pd ) 


In  the  case  of  a completely  mixed  strategy  profile  cr,  it  is  clear  what  it  means 
for  a system  of  beliefs  p to  be  consistent  with  the  strategy  profile  cr:  p must  be 
the  unique  system  of  beliefs  obtained  from  cr  by  applying  Bayesian  updating. 
What  about  assessments  (cr,  p)  where  cr  is  such  that  some  information  sets  are 
not  reached?  How  can  we  decide,  in  such  cases,  whether  p is  consistent  with  cr? 
Kreps  and  Wilson  (1982)  proposed  the  following  criterion:  the  assessment 
(■ <J,p ) is  consistent  if  there  is  a completely  mixed  strategy  profile  cr'  which  is 
arbitrarily  close  to  cr  and  whose  associated  unique  system  of  beliefs  p 
(obtained  by  applying  Bayesian  updating)  is  arbitrarily  close  to  p.  In 
mathematics  “arbitrary  closeness”  is  captured  by  the  notion  of  limit. 


Definition  11.1.  Given  an  extensive  game,  an  assessment  (cr,//)  is  consistent  if 
there  is  a sequence  of  completely  mixed  strategy  profiles  (<Tj,cr2,...,crH,...)  such 
that: 

(1)  the  sequence  converges  to  cr  as  n tends  to  infinity,  that  is,  lim  an  =cr,  and 

ft— » co 

(2)  letting  pn  be  the  unique  system  of  beliefs  obtained  from  <jn  by  using 
Bayesian  updating,  the  sequence  (px,p2, ...,//„,...)  converges  to  p as  n tends  to 
infinity,  that  is,  lim  pn=  p . 


For  example,  consider  the  extensive  form  of  Figure  11.2  below  and  the 
following  assessment:  cr  = ( c,d,f ),  p = 


f s t' 


1 1 
V«  8 J 
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Figure  11.2 
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The  notion  of  consistent  assessment  (cr,//)  was  meant  to  capture  an  extension 
of  the  requirement  of  Bayesian  updating  that  would  apply  also  to  information 
sets  that  have  zero  probability  of  being  reached  (when  the  play  of  the  game  is 
according  to  the  strategy  profile  cr).  Definition  f f.f  is  rather  technical  and  not 
easy  to  apply.  Showing  that  an  assessment  is  consistent  requires  displaying  an 
appropriate  sequence  and  showing  that  it  converges  to  the  given  assessment. 
This  is  relatively  easy  as  compared  to  the  considerably  more  difficult  task  of 
proving  that  an  assessment  is  not  consistent:  this  requires  showing  that  every 
possible  sequence  that  one  could  construct  would  not  converge.  To  see  the  kind 
of  reasoning  involved,  consider  the  extensive  form  of  Figure  11.3. 


Figure  11.3 

We  want  to  show  that  the  following  assessment  is  not  consistent: 

r a b c d T B e f L R\  (r  s t u W 

U 0 0 0 1 0 0 1 1 oJ^'U  tiff/ 

Suppose  that  (cr,//)  is  consistent.  Then  there  must  be  a sequence  (crB)  of 

completely  mixed  strategies  that  converges  to  cr,  whose  corresponding 
sequence  of  systems  of  beliefs  (//„)  12  (obtained  by  applying  Bayesian 
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updating)  converges  to  p Let  the  nh  term  of  this  sequence  of  completely  mixed 
strategies  be: 


'a 

b 

c d 

T 

B 

e f 

L 

R ' 

= 

a 

b 

c d 

T 

B 

e f 

L 

R 

K Pn 

Pn 

Pn  Pn 

Pn 

Pn 

Pn  Pn 

Pn 

Pn) 

Thus  //;  + pbn  + p‘n  + pdn  = p!n  + //  = pi  + p{  = p'n  +p, f = 1 , pxn  e (0, 1)  for  all 
x<E{a,b,c,d,B,T,e,f,L,R},  lim psn  - 0 for  x e {b,c,d,B,e,R}  and  lim pxn  = 1 

«— >00  72— >00 

for  x e {a,T,f,L} . Let  us  compute  the  corresponding  system  of  beliefs  //;I 


obtained  by  Bayesian  updating: 


b . c b , c 

V Pn  + Pn  Pn+Pn) 


and 


P'nPn 


c B f 

P n Pn  P n 


b B . c B f . d b B , c B f , d b B , c B f . d 

PnPn  + Pn  Pn  Pn  + Pn  PnPn  + PnPn  Pn  + Pn  PnPn  + PnPn  Pn  + Pn 


b B . c 5 / 


b B . c B f 


Note  that 


/LO)  _ Pi 


P,M)  P 


= — . By  hypothesis,  lim  /un  (5)  = //(s)  = y and 


lim  pn(r)  = p(r)  = \ and  thus  lim^-  = lim— — = = — — = — = 1. 

n^cc  pn  PM)  lim  jun(r)  p(r)  \ 


On  the  other  hand,  = Ejl.  p f so  that  lim  = lim 

P,M)  pb/n  ’^pn  (0 

By  hypothesis,  lim  pn  (u)  = p(u)  = f and  lim  //(I  (t)  = //(t)  = j , 

n— >oo  «— >oo 

u (u)  ii //(«<) 

so  that  lim  — ’ - ' 


y c 'n 
Ejl pf 

b in 

VPn  ) 


However, 

contradiction. 


PM)  lim pn(t)  p{t ) 

n— >co 

f c \ f 

f 


■ = 2 . 


lim 

«— >oo 


a; 


\l'n 


J 


lim^- 

«— >00  77  ’ \ 

V Pn  A 


( lim  yE ) = (1)(0)  = 0 , yielding  a 

\ n — >oo  / 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  fl.E.l  of  Appendix  ll.E  at 
the  end  of  this  chapter. 
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11.2  Sequential  equilibrium 

The  notion  of  sequential  equilibrium  was  introduced  by  Kreps  and  Wilson 
(1982). 

Definition  11.2.  Given  an  extensive  game,  an  assessment  (cr,//)  is  a sequential 
equilibrium  if  it  is  consistent  (Definition  11.1)  and  sequentially  rational 
(Definition  10.2). 


For  an  example  of  a sequential  equilibrium  consider  the  extensive  game  of 
Figure  11.4  below. 


1 R 


1 3 0 3 6 4 

2 6 3 0 4 2 

5 2 1 2 2 2 


Figure  11.4 


Let  us  show  that  the  following  assessment  is  a sequential  equilibrium: 


M R A B 

0 1 — — 

u l 2 2 


and  //  = 


v y I u v w 


Let  us  first  verify 


sequential  rationality  of  Player  3’s  strategy:  at  her  information  set  {u,v,w} , 
given  her  beliefs,  c gives  a payoff  of  y(5)  + y(l)  + y(2)  = 2 and  d gives  a payoff 
of  y(2)  + |(2)  + y(2)  = 2 . Thus  c is  optimal  (as  would  be  d and  any 
randomization  over  c and  d).  For  Player  2,  at  his  information  set  {x,y} , given 


his  beliefs  and  given  the  strategy  of  Player  3,  A gives  a payoff  of 
i (5) + -f  (3)  = 3.5  and  B gives  a payoff  of  j(2)  + |(4)  = 3.5  ; thus  any  mixture  of 
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A and  B is  optimal,  in  particular,  the  mixture 


A B 

1 1 
V 2 2 J 


is  optimal.  Finally,  at 


the  root,  R gives  Player  1 a payoff  of  3,  L a payoff  of  y(4)  + ^(l)  = 2.5  and  M a 
payoff  of  j (0)  + j (6)  = 3 ; thus  R is  optimal  (as  would  be  any  mixture  of  M and 
R). 

Next  we  show  consistency.  Consider  the  sequence  of  completely  mixed 


strategies  {<7n)n=xx  where  an 


rL  M R 

A 

B 

c d ' 

1 1 1 _ ± 

V n n n 

1 

2 

1 

2 

l_l  i 

n n / 

. Clearly 


limcr  = cr.  The  corresponding  sequence  of  systems  of  beliefs  (fj,  ) is 
given  by  the  following  constant  sequence,  which  obviously  converges  to  ju: 


r x y 

u 

V 

uT 

Rn  = 

i i 

n 1 n 3 

Hi) 

Hi) 

_ 3 3 

±+3_  4 ± + i 4 

\ n n n n 

iU)+K§MB 

) ’ Hi)+Hi)  + Mi 

ry  7J 

We  now  turn  to  the  properties  of  sequential  equilibria. 


Remark  11.1.  Since  consistency  implies  Bayesian  updating  at  reached 
information  sets,  every  sequential  equilibrium  is  a weak  sequential  equilibrium. 


Theorem  11.1  (Kreps  and  Wilson,  1982).  Given  an  extensive  game,  if  (cr,ju)  is 
a sequential  equilibrium  then  <j  is  a subgame-perfect  equilibrium. 


Theorem  11.2  (Kreps  and  Wilson,  1982).  Every  finite  extensive-form  game 
with  cardinal  payoffs  has  at  least  one  sequential  equilibrium  . 


By  Theorem  11.1,  the  notion  of  sequential  equilibrium  achieves  the  objective  of 
refining  the  notion  of  subgame-perfect  equilibrium.  The  relationship  between 
the  various  solution  concepts  considered  so  far  is  shown  in  the  Venn  diagram  of 
Figure  11.5. 
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This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  11.E.2  of  Appendix  ll.E  at 
the  end  of  this  chapter. 


11.3  Is  ‘consistency’  a good  notion? 

The  notion  of  consistency  (Definition  1 1.1)  is  unsatisfactory  in  two  respects. 

From  a practical  point  of  view,  consistency  is  computationally  hard  to  prove, 
since  one  has  to  construct  a sequence  of  completely  mixed  strategies,  calculate 
the  corresponding  Bayesian  beliefs  and  take  the  limit  of  the  two  sequences.  The 
larger  and  more  complex  the  game,  the  harder  it  is  to  establish  consistency. 

From  a conceptual  point  of  view,  it  is  not  clear  how  one  should  interpret,  or 
justify,  the  requirement  of  taking  the  limit  of  sequences  of  strategies  and  beliefs. 
Indeed,  Kreps  and  Wilson  themselves  express  dissatisfaction  with  their 
definition  of  sequential  equilibrium: 
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“We  shall  proceed  here  to  develop  the  properties  of 
sequential  equilibrium  as  defined  above;  however,  we 
do  so  with  some  doubts  of  our  own  concerning  what 
‘ought’  to  be  the  definition  of  a consistent  assessment 
that,  with  sequential  rationality,  will  give  the  ‘proper’ 
definition  of  a sequential  equilibrium.”  (Kreps  and 
Wilson,  1982,  p.  876.) 

In  a similar  vein,  Osborne  and  Rubinstein  (1994,  p.  225)  write 

“We  do  not  find  the  consistency  requirement  to  be 
natural,  since  it  is  stated  in  terms  of  limits;  it  appears  to 
be  a rather  opaque  technical  assumption.” 

In  the  next  chapter  we  will  introduce  a simpler  refinement  of  subgame-perfect 
equilibrium  which  has  a clear  interpretation  in  terms  of  the  AGM  theory  of 
belief  revision:  we  call  it  Perfect  Bayesian  Equilibrium.  We  will  also  show  that 
one  can  use  this  notion  to  provide  a characterization  of  sequential  equilibrium 
that  does  not  require  the  use  of  limits  of  sequences  of  completely  mixed 
strategies. 

We  conclude  this  chapter  by  observing  that  while  the  notion  of  sequential 
equilibrium  eliminates  strictly  dominated  choices  at  information  sets  (even  if 
they  are  reached  with  zero  probability),  it  is  not  strong  enough  to  eliminate 
weakly  dominated  choices.  To  see  this,  consider  the  “simultaneous”  game 
shown  in  Figure  11.6. 

1 


0 2 3 3 


Figure  11.6 
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In  this  game  there  are  two  Nash  (and  subgame-perfect)  equilibria:  (A,  D)  and  (B, 
C).  However,  C is  a weakly  dominated  strategy  for  Player  2.  The  only  beliefs 
of  Player  2 that  rationalize  choosing  C is  that  Player  1 chose  B with  probability 
1 (if  Player  2 attaches  any  positive  probability,  no  matter  how  small,  to  Player 
1 choosing  A,  then  D is  the  only  sequentially  rational  choice).  Nevertheless, 
both  Nash  equilibria  are  sequential  equilibria.  For  example,  it  is 
straightforward  to  check  that  the  “unreasonable”  Nash  equilibrium  a = ( B , C), 


when  paired  with  beliefs  ju 


0 


y 

1 


constitutes  a sequential  equilibrium. 


Consistency  of  this  assessment  is  easily  verified  by  considering  the  sequence 


cr  = 


'A  B 

c zT 

i 1-2- 

V n n 

l-1  1 > 

n n J 

whose  associated  beliefs  are  //„  = 


V n 


y 

l- 


and 


n ) 


sequential  rationality  is  clearly  satisfied. 


Many  game  theorists  feel  that  it  is  “irrational”  to  choose  a weakly  dominated 
strategy;  thus  further  refinements  beyond  sequential  equilibrium  have  been 
proposed.  A stronger  notion  of  equilibrium,  which  is  a strict  refinement  of 
sequential  equilibrium,  is  the  notion  of  trembling-hand  perfect  equilibrium.  This 
notion,  due  to  Reinhardt  Selten  (who  also  introduced  the  notion  of  subgame- 
perfect  equilibrium)  precedes  chronologically  the  notion  of  sequential 
equilibrium  (Selten,  1975).  Trembling-hand  perfect  equilibrium  does  in  fact 
eliminate  weakly  dominated  strategies.  This  topic  is  outside  the  scope  of  this 
book/ 


7 The  interested  reader  is  referred  to  van  Damme  (2002). 
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Appendix  11. E:  Exercises 

ll.E.l.  Exercises  for  Section  11.1:  Consistent 
assessments 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  11.1.  Consider  the  following  extensive  form. 


Consider  the  following  (partial)  behavior  strategy  profile 


f G 

b c 

d 

cr  = 

1 

3 1 

1 

3 

V 5 

5 5 

4 

4 / 

Find  the  corresponding  system  of  beliefs  obtained  by  Bayesian  updating. 
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(a)  Write  the  corresponding  strategic  form. 

(b)  Find  all  the  pure-strategy  Nash  equilibria. 

(c)  Find  all  the  pure-strategy  subgame-perfect  equilibria. 

(d)  Which  of  the  pure-strategy  subgame-perfect  equilibria  can  be  part  of  a 
consistent  assessment?  Give  a proof  for  each  of  your  claims. 
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11. E. 2.  Exercises  for  Section  11.2:  Sequential 
equilibrium 

Exercise  11.3.  Consider  the  following  extensive-form  game.  For  each  pure- 
strategy  Nash  equilibrium  determine  whether  it  is  part  of  an  assessment  which 
is  a sequential  equilibrium. 


1 


0 

1 

1 


0 2 0 
1 2 0 
0 1 0 
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Exercise  11.4.  Consider  the  following  game: 


(a)  Find  three  subgame-perfect  equilibria.  [Use  pure  strategies  wherever 
possible.] 

(b)  For  each  of  the  equilibria  you  found  in  part  (a),  explain  if  it  can  be  written 
as  part  of  a weak  sequential  equilibrium. 

(c)  Find  a sequential  equilibrium.  [Use  pure  strategies  wherever  possible.] 
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Exercise  11.5.  An  electric  circuit  connects  two  switches  and  a light.  One 
switch  is  in  Room  1,  the  second  switch  is  in  Room  2 and  the  light  is  in  Room  3. 
Player  1 is  in  Room  1,  Player  2 in  Room  2 and  Player  3 in  Room  3.  The  two 
switches  are  now  in  the  Off  position.  The  light  in  Room  3 comes  on  if  and 
only  if  both  switches  are  in  the  On  position.  Players  1 and  2 act  simultaneously 
and  independently:  each  is  allowed  only  two  choices,  namely  whether  to  leave 
her  switch  in  the  Off  position  or  turn  it  to  the  On  position;  these  decisions 
have  to  be  executed  within  one  minute.  If  the  light  comes  on  in  Room  3 then 
the  game  ends  and  Players  1 and  2 get  $100  each  while  Player  3 gets  $300.  If  the 
light  in  Room  3 stays  off  after  one  minute,  then  Player  3 (not  knowing  what 
the  other  players  did)  has  to  make  a guess  as  to  what  Players  1 and  2 did  (thus, 
for  example,  one  possible  guess  is  “both  players  left  their  respective  switches  in 
the  Off  position”).  The  payoffs  are  as  follows:  (i)  if  Player  3’s  guess  turns  out  to 
be  correct  then  each  player  gets  $200,  (ii)  if  Player  3 makes  one  correct  guess 
but  the  other  wrong  (e.g.  he  guesses  that  both  Player  1 and  Player  2 chose 
“Off”  and,  as  a matter  of  fact,  Player  1 chose  “Off”  while  Player  2 chose  “On”), 
then  Player  3 gets  $50,  the  player  whose  action  was  guessed  correctly  gets  $100 
and  the  remaining  player  gets  nothing  (in  the  previous  example,  Player  1 gets 
$100,  Player  2 gets  nothing  and  Player  3 gets  $50)  and  (iii)  if  Player  3’s  guess  is 
entirely  wrong  then  all  the  players  get  nothing.  All  the  players  are  selfish  and 
greedy  (that  is,  each  player  only  cares  about  how  much  money  he/ she  gets  and 
prefers  more  money  to  less)  and  risk  neutral. 

(a)  Represent  this  situation  as  an  extensive-form  game. 

(b)  Write  the  corresponding  strategic  form,  assigning  the  rows  to  Player  1,  the 
columns  to  Player  2,  etc. 

(c)  Find  all  the  pure-strategy  Nash  equilibria. 

(d)  For  at  least  one  pure-strategy  Nash  equilibrium  prove  that  it  cannot  be 
part  of  a sequential  equilibrium. 
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000000000000 

Exercise  11.6.  (Challenging  Question). 

A buyer  and  a seller  are  bargaining  over  an  object  owned  by  the  seller.  The 
value  of  the  object  to  the  buyer  is  known  to  her  but  not  to  the  seller.  The 
buyer  is  drawn  randomly  from  a population  with  the  following  characteristics: 
the  fraction  A value  the  object  at  $H  while  the  remaining  fraction  value  the 
object  at  $L,  with  H > L > 0.  The  bargaining  takes  place  over  two  periods.  In 
the  first  period  the  seller  makes  a take-it-or-leave-it  offer  (i.e.  names  the  price) 
and  the  buyer  accepts  or  rejects.  If  the  buyer  accepts,  the  transaction  takes  place 
and  the  game  ends.  If  the  buyer  rejects,  then  the  seller  makes  a new  take-it-or- 
leave-it  offer  and  the  buyer  accepts  or  rejects.  In  either  case  the  game  ends. 
Payoffs  are  as  follows:  (1)  if  the  seller’s  offer  is  accepted  (whether  it  was  made  in 
the  first  period  or  in  the  second  period),  the  seller’s  payoff  is  equal  to  the  price 
agreed  upon  and  the  buyer’s  payoff  is  equal  to  the  difference  between  the  value 
of  the  object  to  the  buyer  and  the  price  paid;  (2)  if  the  second  offer  is  rejected 
both  players  get  a payoff  of  0.  Assume  that  both  players  discount  period  2 
payoffs  with  a discount  factor  S e (0,1),  that  is,  from  the  point  of  view  of 
period  1,  getting  $x  in  period  2 is  considered  to  be  the  same  as  getting  $<5r  in 
period  1.  For  example,  if  the  seller  offers  price  p in  the  first  period  and  the  offer 
is  accepted,  then  the  seller’s  payoff  is  p,  whereas  if  the  same  price  p is  offered 
and  accepted  in  the  second  period,  then  the  seller’s  payoff,  viewed  from  the 
standpoint  of  period  1,  is  dp.  Assume  that  these  payoffs  are  von  Neumann- 
Morgenstern  payoffs. 

(a)  Draw  the  extensive  form  of  this  game  for  the  case  where,  in  both  periods, 
the  seller  can  only  offer  one  of  two  prices:  $10  or  $12;  assume  further  that 
H = 20,  L = 10,  S = | and  A = f . Nature  moves  first  and  selects  the  value 
for  the  buyer;  the  buyer  is  informed,  while  the  seller  is  not.  It  is  common 
knowledge  between  buyer  and  seller  that  Nature  will  pick  H with 
probability  A and  L with  probability  (1  -A) . 

(b)  For  the  game  of  part  (a)  find  a pure-strategy  sequential  equilibrium.  Prove 
that  what  you  suggest  is  indeed  a sequential  equilibrium. 
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Appendix  ll.S:  Solutions  to  exercises 


Exercise  11.1.  The  extensive  form  under  consideration  is 


The  system  of  beliefs  obtained,  by  Bayesian  updating,  from  the  (partial) 

is  as  follows: 


behavior  strategy  profile  a = 


f a 

b c 

d 

cr  = 

1 

3 1 

1 

3 
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(a)  The  corresponding  strategic  form  is  as  follows: 


Player  2 


tl 

tr 

bl 

br 

L 

0,  10 

0,  10 

0 , 10 

0,  10 

Player  R 

1,1 

1,1 

-1,-1 

-1,-1 

l W 

-1,0 

0,1 

-1,0 

0,1 

E 

-1,1 

1,2 

-1,1 

1,2 

(b)  The  pure-strategy  Nash  equilibria  are:  (L,  bl),  ( R,  tl ),  (R,  tr),  [E,  tr),  (E,  br). 

(c)  There  is  only  one  proper  subgame,  namely  the  one  that  starts  at  node  Tq  ; in 
that  subgame  the  unique  Nash  equilibrium  is  t.  Thus  only  the  following  are 
subgame-perfect  equilibria:  (R,  tl),  (R,  tr),  (E,  tr). 


(c)  Each  of  the  above  subgame-perfect  equilibria  can  be  part  of  a consistent 


assessment.  Let  a = ( R,tl ),  /u  = 


rx2  x ^ 

1 1 

V 2 2 J 


this  assessment  is  consistent  as  the 


following  sequences  of  completely  mixed  strategies  and  corresponding 
system  of  beliefs  show  (n  = 1,2,...): 
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f 

f L W E R 

b t 

I r'] 

= 

'ill  i_l 

% l-i 

l-1 

’ Mn  ~ 

\ n n n n 

n n 

n n J 

V 

x2  x3 

1 1 
II  J_  II  J_ 

_L_I__L  2 J_  j_  J_  2 


n n 


Clearly  lim^  an  = a and  limn^x  fin=/u. 
Let  a = {R,tr),  //  = 


fx2 

i i 

V 2 2 J 


J;  this  assessment  is  consistent  as  the  following 
sequences  of  completely  mixed  strategies  and  corresponding  system  of 
beliefs  show:  cr  = f 


cr,  = 

r L W E R 

b t 

I r ' 

1 1 1 1 — 1 

V n n n n 

i i_i 

n n 

i i-ij 

Mn  = 


i + l 2 i + l 

\ n n n n 


■ Clearly  lim^x  cr ; = a and  lim,,^  jun  = [u  . 


Let  cr  = ( E,tl ),  ju  = 


fX2  X3^ 

0 1 


; this  assessment  is  consistent  as  the  following 


sequences  of  completely  mixed  strategies  and  corresponding  beliefs  show: 


cr  = 


f 

W E R 

b t 

1 r'' 

i i_l  i 

i i_i 

l-i  i 

> Mn  ~ 

n n n 

n n 

n n / 

V 

1- 


l+l_l  i+i-l 

n n n n J 


Clearly  lim^rx  cr„  = a and  lim,^x  //„=//. 
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Exercise  11.3.  The  game  under  consideration  is  the  following: 


1 


0 0 2 0 
1 12  0 
1 0 1 0 


(a)  The  corresponding  strategic  form  is: 


R 


Player  M 
1 L 


Player  2 

/ 


1,0,0 

1,0,0 

0,1,1 

2,2,1 

0,1,0 

0,0,0 

Player  2 

/ r 


1,0,0 

1,0,0 

0,1,0 

0,0,0 

0,1,0 

0,0,0 

Player  3 

chooses  a 


Player  3 chooses  b 


(b)  The  pure-strategy  Nash  equilibria  are:  ( R,l,a ),  ( M,r,a ),  ( R,l,b ) and  ( R,r,b ). 
They  are  all  subgame  perfect  because  there  are  no  proper  subgames.  ( R,l,b ) 
and  ( R,r,b ) cannot  be  part  of  a sequential  equilibrium  because  b is  a strictly 
dominated  choice  at  Player  3’s  information  set  and,  therefore,  ( R,l,b ) and 
( R,r,b ) would  violate  sequential  rationality  (with  any  system  of  beliefs).  On 
the  other  hand,  both  (R,l,a)  and  (M,r,a)  can  be  part  of  an  assessment  which 
is  a sequential  equilibrium. 
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First  we  show  that  cr  = ( R,l,a ) together  with  the  system  of  beliefs 


M = 
completely 


2 

V 3 


1 0 

y 

mixed 


cr  = 

rL  M R 

1 r 

a b ^ 

n 

1 1 1 _ 3. 

\ n n n 

l_i  i 

n n 

l-1  i 

n n y 

is  a sequential  equilibrium.  Consider  the  sequence  of 

strategy  profiles  whose  nh  term  is 
Clearly  lim  cr, , = cr . The 


corresponding  Bayesian  system  of  beliefs  has  nth  term 


Mn  = 


x, 


1 + 1 
y n n 


— Y — 

n n 


X , 


J-(l-J-) 

n \ n / 


i(l-1)  + i(1) 

n \ n / n \ n ) 


= 1-^ 


J-(J-) 

n \ n / 


1(l-i)  + i(i) 

n \ n / n \ n / 


Clearly  lim  jun  = /u . Thus  the  assessment  is  consistent  (Definition  11.1). 

n—>co 

Sequential  rationality  is  easily  checked:  given  the  strategy  profile  and  the 
system  of  beliefs, 

(1)  for  Player  3,  a yields  1,  while  b yields  0,  (2)  for  Player  2,  / yields  1,  while  r 
yields  f(0)  + }(2)  = f and  (3)  for  Player  1,  R yields  1,  while  L and  M yield  0. 


x2 

x3  x4^ 

lo 

1 

0 1 J 

Next  we  show  that  cr  = ( M,r,a ) together  with  ju  = | | | is  a 

sequential  equilibrium.  Consider  the  sequence  of  completely  mixed  strategy 

. Clearly 


profiles  whose  nh  term  is  cr,  = 

V 2n  * n 2n  I n * n I ^ 

lim  cr;  = cr . The  corresponding  Bayesian  system  of  beliefs  has  77th  term 


' L M R 

1 r 

a b ^ 

\ 2 n n 2n 

1 i_i 

n n 

l-1 

n n y 

Mn  = 


2«-l  ^ 2n—l 


XA 


r(l-i) 


_ i 


i 1-i 


Clearly  lim jun=ju.  Thus  the  assessment  is  consistent  (Definition  11.1). 

n—>co 

Sequential  rationality  is  easily  checked:  given  the  strategy  profile  and  the 
system  of  beliefs, 

(1)  for  Player  3,  a yields  1,  while  b yields  0,  (2)  for  Player  2,  / yields  1,  while  r 
yields  2 and  (3)  for  Player  1,  M yields  2,  while  R yields  1 and  L yields  0. 
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Exercise  11.4.  The  game  under  consideration  is  the  following: 


1 

2 
2 


(a)  First  we  solve  the  subgame  on  the  left,  whose  strategic  form  is: 


Player 

1 


Player  2 
C D 


1 ,0 

0,1 

0,2 

2,0 

There  is  no  pure-strategy  Nash  equilibrium.  Let  p be  the  probability  of  A and  q 
the  probability  of  C.  Then  at  a Nash  equilibrium  it  must  be  that  q = 2(1  -q)  and 

2(1  -p)  = p.  Thus  there  is  a unique  Nash  equilibrium  given  by 
with  an  expected  payoff  of  f for  both  players. 


fA  B 

C zT 

2 1 

2 1 

V 3 3 

3 3 J 

Next  consider  the  subgame  on  the  right.  In  this  subgame  the  following  are 
pure-strategy  Nash  equilibria:  ( F,H,M ) (where  Player  l’s  payoff  is  1),  ( E,L,H ) 
(where  Player  l’s  payoff  is  2),  and  (E,L,  G ) (where  Player  l’s  payoff  is  2). 

Thus  the  following  are  subgame-perfect  equilibria: 


'L  R 

A B 

C D 

E F 

G H 

L 

v°  1 

2 1 

3 3 

2 1 

3 3 

0 1 

0 1 

0 1 V 

411 


GAME  THEORY  - Giacomo  Bonanno 


rL  R 

A B 

C D 

E F 

G H 

L 

v°  1 

2 1 

3 3 

2 1 

3 3 

1 0 

0 1 

1 0 , 

rL  R 

A B 

C D 

E F 

G H 

L 

1°  1 

2 1 

3 3 

2 1 

3 3 

1 0 

1 0 

1 0 , 

(b) 


r L R 

A B 

C D 

E F 

G H 

L 

1°  1 

2 1 

3 3 

2 1 

3 3 

0 1 

0 1 

0 1 V 

cannot  be  part  of  a weak 

sequential  equilibrium,  because  choice  M is  strictly  dominated  by  L and 
thus  there  are  no  beliefs  at  Player  3’s  information  set  that  justify  choosing 

M. 


r L R 

A B 

C D 

E F 

G H 

L 

v°  1 

2 1 

3 3 

2 1 

3 3 

1 0 

0 1 

1 0 , 

cannot  be  part  of  a weak 


sequential  equilibrium,  because  - given  that  Player  3 chooses  L - H is  not 
a sequentially  rational  choice  for  Player  2 at  his  singleton  node. 


r L R 

A B 

C D 

E F 

G H 

L 

v°  1 

2 1 

3 3 

2 1 

3 3 

1 0 

1 0 

1 0 , 

is  a weak  sequential 


equilibrium  with  the  following  system  of  beliefs: 


fx 

y 

w z ^ 

2 
V 3 

i 

3 

1 0. 

(c) 


' L R 

A B 

C D 

E F 

G H 

L 

v°  1 

2 1 

3 3 

2 1 

3 3 

1 0 

1 0 

1 0 , 

M = 


^ y 

w z ^ 

2 2 

V 3 3 

1 

together  with  beliefs 


is  a sequential  equilibrium.  Consistency  can  be  verified 


with  the  sequence  of  completely  mixed  strategies 


cr  = 


Cl  r 

A B 

C D 

E F 

G H 

L M\ 

U i-i 

V n n 

2 1 

3 3 

2 1 

3 3 

1-1  1 

n n 

l-i  i 

n n 

l-i  i 

n n J 

For  example,  jUn(w)  = 


0-i)(i-i)+(i-i)t(i-i)  i+i 

Sequential  rationality  is  easily  verified. 


• 1 as  n 


■ oo . 
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Exercise  11.5.  (a)  The  extensive  form  is  as  follows,  where  FF  means  10ff-20ff, 
FN  means  10ff-20n,  etc. 


(b)  The  corresponding  strategic  form  is  as  follows: 

Player  2 


On 

Off 

Player  1 

On 

100 , 100 , 300 

200 , 200 , 200 

Off 

0,0,0 

0 , 100 , 50 

Player  3: 

10n-20ff 

Player  2 

On 

Off 

Player  1 

On 

100 , 100 , 300 

0,0,0 

Off 

200 , 200 , 200 

100 , 0 , 50 

Player  3: 

10ff-20n 

Player  2 

On 

Off 

Player  1 

On 

100 , 100 , 300 

0,100, 100 

Off 

100 , 0 , 50 

200 , 200 , 200 

Player  3:  both  Off 
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(c)  The  Nash  equilibria  are  highlighted  in  the  strategic  form: 
{On,  Off,  \On-ZOff),  ( Off,  On,  XOff-lOri),  {On,  On,  both-Off)  and 
{Off,  Off,  both-Off). 


(d)  {On,  On,  both-Off)  cannot  be  part  of  a sequential  equilibrium.  First  of 
all,  for  Player  3 ‘ both-Off  ’ is  a sequentially  rational  choice  only  if  Player 
3 attaches  (sufficiently  high)  positive  probability  to  node  x.  However, 
consistency  does  not  allow  beliefs  with  p{x)  > 0.  To  see  this,  consider  a 
sequence  of  completely  mixed  strategies  {p„,q„}  for  Players  1 and  2, 
where  pn  is  the  probability  with  which  Player  1 chooses  On  and  qn  is 
the  probability  with  which  Player  2 chooses  On  and  lim  pn  = \vcaqn  = 0. 

n— >oo  n—>  co 

Then,  by  Bayesian  updating, 


Pn(x\{x,y,z}) 


Pnfn 


(♦) 


If  qn  goes  to  0 as  fast  as,  or  faster  than,  pn  (that  is,  if  lim—  is  finite 


then  divide  numerator  and  denominator  of  (♦)  by  pn  to  get 


Pn[x\{x,y,z})  = 


qn 


qn+(\-qn)  + ^(\-pn) 

Pn 


. Taking  the  limit  as  n 


■ oo 


we  get 


0 


0 + 1 + 


= 0 . [If  pn  goes  to  0 as  fast  as  or  faster  than  qn 


(i) 


lint 

^00  pn  J 

then  repeat  the  above  argument  by  dividing  by  qn .]  Thus  a consistent 
assessment  must  assign  zero  probability  to  node  x. 
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(b)  The  following  assessment  (cr,//)  is  a sequential  equilibrium.  The  pure- 
strategy  profile  cr  is  as  follows: 

The  seller  offers/?  = 12  in  period  f and,  if  his  offer  is  rejected,  he  offers  q = 
fO  in  period  2;  furthermore,  if  the  first-period  offer  had  been  q = 10  and  it 
had  been  rejected  then  he  would  have  offered  q = 10  again  in  the  second 
period. 

The  H buyer  (that  is,  the  buyer  at  information  sets  that  follow  Nature’s 
choice  of  H)  always  says  yes  to  any  offer  of  the  seller. 

The  L buyer  (that  is,  the  buyer  at  information  sets  that  follow  Nature’s 
choice  of  L ) always  says  no  to  an  offer  of  p and  always  says  yes  to  an  offer 
of  q. 

The  system  of  beliefs  is  as  follows  (where  TL  means  the  left  node  of  the  top 
information  set  of  the  seller,  TR  the  right  node  of  that  information  set,  ML 
means  the  left  node  of  the  middle  information  set  of  the  seller,  MR  the  right 
node  of  that  information  set,  BL  means  the  left  node  of  the  bottom 
information  set  of  the  seller,  BR  the  right  node  of  that  information  set): 


rTL  TR 

ML  MR 

BL  BR' 

v°  1 

2 1 

3 3 

0 1 J 

Let  us  first  check  sequential  rationality.  The  seller’s  payoff  is 
(f  112  + (})  (-|)  10  = -f-  which  is  greater  than  the  payoff  he  would  get  if  he 

offered  q = 10,  namely  a payoff  of  10.  The  //-type’s  payoff  is  20  -12  = 8, 
while  if  she  said  no  to  p = 12  and  then  yes  to  q = 10  in  period  2 her  payoff 
would  be  |(20-10)  = If  = 7.5.  Furthermore,  for  the  H type,  at  every  node 
of  hers,  saying  yes  is  always  strictly  better  than  saying  no.  The  T -type’s 
payoff  is  0,  while  if  she  said  yes  to  p = 12  then  her  payoff  would  be  -2.  At 
every  node  of  the  L type  after  having  been  offered  p = 12  saying  no  is 
strictly  better  than  saying  yes  and  at  every  node  after  having  been  offered  q 
= 10  saying  no  gives  the  same  payoff  as  saying  yes,  namely  0. 

To  check  consistency,  construct  the  following  completely  mixed  strategy 
profile  (cr ; ) , : (1)  for  the  seller  and  for  the  L -buyer,  any  choice  that  has 

zero  probability  in  cr  is  assigned  probability  j in  <jn  and  any  choice  that 
has  probability  1 in  cr  is  assigned  probability  1 - \ in  <Jn,  (2)  for  the  H- 
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buyer,  any  choice  that  has  zero  probability  in  a is  assigned  probability  4- 
in  <ju  and  any  choice  that  has  probability  1 in  a is  assigned  probability 
1 - 4-  in  <jn.  Let  us  compute  the  corresponding  beliefs  jun  at  the  top  and  at 


the  bottom  information  sets  of  the  seller:  ju„(TL ) : 


+ I- 

n \ „2  ) ‘ n V n ) « 


thus  lim  fj,  ( TL ) = 0 = ju(TL ) and 


\imjun(TR)  = \ = ju(TR);  m„(BL)  = 


(i-i)O  , 


M,(BR)==j — lU1,"  , — " — pr  = l-t;thus  nn  0 = lit  BL)  and 


lim  fiJBR)  = I = fj(BR) . 
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Perfect  Bayesian  Equilibrium 


12.1  Belief  revision  and  A GM  consistency 

Any  attempt  to  refine  the  notion  of  subgame-perfect  equilibrium  in 
extensive-form  (or  dynamic)  games  must  deal  with  the  issue  of  belief 
revision-,  how  should  a player  revise  her  beliefs  when  informed  that 
she  has  to  make  a choice  at  an  information  set  to  which  she  initially  assigned 
zero  probability?  As  we  saw  in  Chapter  11,  Kreps  and  Wilson  (1982)  suggested 
the  notion  of  a consistent  assessment  (Definition  11.1)  to  deal  with  this  issue. 
From  now  on,  we  shall  refer  to  the  notion  of  consistency  proposed  by  Kreps 
and  Wilson,  as  KW-consistency  (KW  stands  for  ‘Kreps-Wilson’),  in  order  to 
distinguish  it  from  a different  notion  of  consistency,  called  AGM-consistency, 
that  will  be  introduced  in  this  section.  We  shall  make  use  of  concepts 
introduced  in  Section  8.3  (Chapter  8):  the  reader  might  want  to  review  that 
material  before  continuing. 

In  this  chapter  it  will  be  more  convenient  to  use  the  so  called  “history-based” 
definition  of  extensive-form  game,  which  is  spelled  out  in  Appendix  12. A. 
Essentially  it  boils  down  to  identifying  a node  in  the  tree  with  the  sequence  of 
actions  leading  from  the  root  to  it.  We  call  a sequence  of  actions  starting  from 
the  root  of  the  tree  a history.  For  example,  consider  the  extensive  form  of 
Figure  12.1. 
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Node  x0  (the  root  of  the  tree)  is  identified  with  the  null  or  empty  history  0 , 
decision  node  x6  with  the  history  (or  sequence  of  actions)  adf  terminal  node  z8 
with  history  adfn,  etc.  Thus  it  will  no  longer  be  necessary  to  label  the  nodes  of 
the  tree,  because  we  can  refer  to  them  by  naming  the  corresponding  histories. 

If  h is  a decision  history  we  denote  by  A{h)  the  set  of  actions  (or  choices) 
available  at  h.  For  example,  in  the  game  of  Figure  12.1,  A(a)  = {c,d}, 
A(ac ) = {e,f} , A(adf)  = {m,n} , etc.  If  h is  a history  and  a is  an  action  available 
at  h (that  is  a e A(h)),  then  we  denote  by  ha  the  history  obtained  by  appending 
a to  h. 

We  saw  in  Chapter  8 (Section  8.3)  that  the  AGM  theory  of  belief  revision 
(introduced  by  Alchourron  et  al.,  1985)  is  intimately  linked  to  the  notion  of  a 
plausibility  order.  This  is  what  motivates  the  following  definition. 8 


8 For  an  explicit  analysis  of  the  connection  between  the  AGM  theory  of  belief  revision  and  the 
notion  of  plausibility  order  introduced  in  Definition  12.1  see  Bonanno  (2011). 
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First  recall  that  a total  pre-order  on  a set  H is  a binary  relation  f which  is 
complete  (for  all  h,  h'  e H,  either  h f h'  or  h'  f h or  both)  and  transitive  (for 
all  h,h',h"  e H, , if  h f It!  and  h'  f h"  then  h f h").  We  write  h ~ A'  as  a short- 
hand for  “ hfh'  and  A' A A”  and  h<h'  as  a short-hand  for  “ hfh'  and 
h'fh”. 


Definition  12.1.  Given  a finite  extensive  form,  a.  plausibility  order  is  a total  pre- 
order f on  the  finite  set  of  histories  H that  satisfies  the  following  properties 
(recall  that  D is  the  set  of  decision  histories,  A(h)  is  the  set  of  actions  available 
at  decision  history  h and  /(A)  is  the  information  set  that  contains  decision 
history  h) : for  all  h eD  , 

PL1.  For  all  a e A{h) , hfha. 

PL2.  ( i ) There  exists  an  a e A(h)  such  that  h ~ ha , and 

(ii)  for  all  a e A(h) , if  h ~ ha  then  h’  ~ h'a  for  all  h'  e 1(h) , 

PL3.  if  history  h is  assigned  to  chance,  then  h ~ ha , for  all  a e A(h) . 


The  interpretation  of  h ^ h'  is  that  history  h is  at  least  as  plausible  as  history  h' ; 
thus  h < h'  means  that  h is  more  plausible  than  h'  and  h ~ h'  means  that  h is  just 
as  plausible  as  h' . An  alternative  reading  of  h A /?'  is  “ h weakly  precedes  h'  in- 
terms  of  plausibility”. 

Property  PL1  of  Definition  12.1  says  that  adding  an  action  to  a decision  history 
h cannot  yield  a more  plausible  history  than  h itself.  Property  PL2  says  that  at 
every  decision  history  h there  is  at  least  one  action  a which  is  “plausibility 
preserving”  in  the  sense  that  adding  a to  h yields  a history  which  is  as  plausible 
as  h;  furthermore,  any  such  action  a performs  the  same  role  with  any  other 
history  that  belongs  to  the  same  information  set  as  h.  Property  PL3  says  that  all 
the  actions  at  a history  assigned  to  chance  are  plausibility  preserving. 
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Definition  12.2.  Given  an  extensive-form,  an  assessment  (cr,//)  is 
AGM-consistent  if  there  exists  a plausibility  order  A on  the  set  of  histories  H 
such  that: 

(z)  the  actions  that  are  assigned  positive  probability  by  a are  precisely  the 
plausibility-preserving  actions:  for  all  h e D and  for  all  a e A(h ) , 

cr(o)  > 0 if  and  only  if  h~ha  (PI) 

(it)  the  histories  that  are  assigned  positive  probability  by  ju  are  precisely  those 
that  are  most  plausible  within  the  corresponding  information  set: 
for  all  heD, 

ju{h)  > 0 if  and  only  if  h A h',  for  all  h!  e 1(h) . (P2) 

If  A satisfies  properties  PI  and  P2  with  respect  to  (<j,ju),  we  say  that  A 
rationalizes  (cr,/u). 

In  conjunction  with  sequential  rationality,  the  notion  of  AGM-consistency  is 
sufficient  to  rule  out  some  subgame-perfect  equilibria.  Consider,  for  example, 
the  extensive  game  of  Figure  12.2  and  the  pure-strategy  profile  a = (c,d,f) 
(highlighted  by  double  edges),  which  constitutes  a Nash  equilibrium  of  the 
game  (and  also  a subgame-perfect  equilibrium  since  there  are  no  proper 
subgames). 


0 0 1 0 0 1 
0 10  11  0 


Figure  12.2 
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Can  a be  part  of  a sequentially  rational  AGM-consistent  assessment  (a,  ju)  ? 
Since,  for  Player  3,  choice  / can  be  chosen  rationally  only  if  the  player  assigns 
(sufficiently  high)  positive  probability  to  history  be,  sequential  rationality 
requires  that  ju(be ) > 0 ; however,  any  such  assessment  is  not  AGM-consistent. 
In  fact,  if  there  were  a plausibility  order  A that  satisfied  Definition  12.2,  then, 
by  PI,  b~bd  (since  cr(d)  = 1 > 0 ) and  b<be  (since  cr(e)  = 0)  and,  by  P2, 
be^bd  (since  - by  hypothesis  - ju  assigns  positive  probability  to  be).  By 
transitivity  of  ^ , from  b<be  and  b ~ bd  it  follows  that  bd  < be,  yielding  a 
contradiction. 

On  the  other  hand,  the  Nash  equilibrium  cr'(c)  = 1,  cr\d)  = <j\e)  = \ , 
a\f ) = cr'(g)  = j together  with  beliefs  ju'(bd)  = ju'(be ) = \ forms  a sequentially 
rational,  AGM-consistent  assessment:  it  can  be  rationalized  by  the  plausibility 
order  shown  in  Figure  12.3,  where  each  row  represents  an  equivalence  class  (0 
denotes  the  null  history,  that  is,  the  root  of  the  tree): 

f 0,c  most  plausible^ 

b,  bd,be,  bdf,  bdg,  bef,  beg 

a,af,ag  least  plausible  J 

Figure  12.3 

We  use  the  following  convention  in  representing  a plausibility  order:  if  the  row 
to  which  history  h belongs  is  above  the  row  to  which  h'  belongs,  then  h<h'  ( h 
is  more  plausible  than  h')  and  if  h and  h'  belong  to  the  same  row  then  h ~ h' 
(, h is  as  plausible  as  h'). 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  12.E.1  of  Appendix  12.E  at 
the  end  of  this  chapter. 


By  PL1  of  Definition  12.1,  b^be  and,  by  PI  of  Definition  12.2,  it  is  not  the  case  that  b~be 
because  e is  not  assigned  positive  probability  by  a . Thus  b < be . 
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12.2  Bayesian  consistency 

The  definition  of  AGM-consistency  deals  with  the  supports  of  a given 
assessment,  that  is  with  the  actions  that  are  assigned  positive  probability  by  the 
strategy  profile  a and  the  histories  that  are  assigned  positive  probability  by  the 
system  of  beliefs  //.  In  this  sense  it  is  a qualitative  (as  opposed  to  quantitative) 
property:  how  the  probabilities  are  distributed  on  those  supports  is  irrelevant 
for  AGM-consistency.  However,  we  also  need  to  impose  quantitative 
restrictions  concerning  the  actual  probabilities.  The  reason  for  this  is  that  we 
want  the  given  assessment  to  satisfy  “Bayesian  updating  as  long  as  possible”.  By 
this  we  mean  the  following: 

(1)  when  information  causes  no  surprises,  because  the  play  of  the  game  is 
consistent  with  the  most  plausible  play(s)  (that  is,  when  information  sets  are 
reached  that  have  positive  prior  probability),  then  beliefs  should  be  formed 
using  Bayesian  updating  (Definition  8.3,  Chapter  8),  and 

(2)  when  information  is  surprising  (that  is,  when  an  information  set  is  reached 
that  had  zero  prior  probability)  then  new  beliefs  can  be  formed  in  an 
arbitrary  way,  but  from  then  on  Bayesian  updating  should  be  used  to 
update  those  new  beliefs,  whenever  further  information  is  received  that  is 
consistent  with  those  beliefs. 

The  next  definition  captures  the  above  requirements. 

Definition  12.3.  Given  a finite  extensive  form,  let  A be  a plausibility  order 
that  rationalizes  the  assessment  (cr,//) . We  say  that  (cr,//)  is  Bayesian  (or  Bayes 
consistent)  relative  to  A if  for  every  equivalence  class  A of  A that  contains  some 
decision  history  h with  ju(h)  > 0 (that  is,  E nD)  ^0 , 

where  D+  = {h  eD  : ju(h)  > 0} ) there  exists  a probability  distribution 
vE  : H — » [0, 1]  (recall  that  H is  a finite  set)  such  that: 

Bl.  vE(h)>0  if  and  only  if  h e E r\  Du . 

B2.  If  h,h'  e E n£C  and  h!  = hal...am  (that  is,  h is  a prefix  of  h')  then 
vE  (h')  = vE  (h)  x o-fa, ) x . ..  x a(am  ) . 

def  "Ah') 

vE{h")  ' 

h’Gl(h) 


B3.  If  he  E then,  for  every  h'  e 1(h) , ju(h')=vE  {h'  \ 1(h))  = 


E 
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Property  B1  requires  that  vE(h)>0  if  and  only  if  heE  and  ju(h)>  0. 
Property  B2  requires  vE  to  be  consistent  with  the  strategy  profile  a in  the 
sense  that  if  h,h'  e E , ju(h ) > 0 , ju(h')  > 0 and  h'  = hav..am  then  the  probability 
that  vE  assigns  to  h'  is  equal  to  the  probability  that  vE  assigns  to  h multiplied 
by  the  probabilities  (according  to  o)  of  the  actions  that  lead  from  h to  h'  .10 
Property  B3  requires  the  system  of  beliefs  //  to  satisfy  the  conditional 
probability  rule  in  the  sense  that  if  h e E and  /u(h)  > 0 (so  that  E is  the 
equivalence  class  of  the  most  plausible  elements  of  1(h))  then,  for  every  history 
h'  e 1(h) , ju(h')  (the  probability  assigned  to  h'  by  ju)  coincides  with  the 
probability  of  h'  conditional  on  1(h)  using  the  probability  distribution  vE  . 

As  an  example  of  an  AGM-consistent  and  Bayesian  consistent  assessment, 
consider  the  extensive  form  of  Figure  12.4  and  the  assessment 


' a b c 

d 

e 

/ 

g } 

' ad  ae  b 

a bf 

bg' 

cr  = 

y0  0 1 

1 

0 

1 

3 

2 

3 J 

> M = 

i o 1 

^4  U 4 

1 1 

4 4 

2 

4 y 

1 

c 


Figure  12.4 


10  Note  that  if  h,hf  eE  and  h'  = hav..am  then  <r(a;)>0  for  all  j = 1 In  fact,  since  h~h' , 
every  action  is  plausibility  preserving  and  thus,  by  Property  PI  of  Definition  12.2, 
cr{a  ) >0  . 
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f a b c 

d 

e 

/ 

r ad  ae  b 

a bf 

bg ' 

v0  0 1 

1 

0 

1 

3 

2 

12 

> M = 

i o 1 

V 4 W 4 

1 1 

4 4 

2 

4 y 

The  assessment  a = 


is  AGM  consistent,  since  it  is  rationalized  by  the  plausibility  order  shown  in 
Figure  12.5 


0,c 

a,b,  ad,bf,  bg,  adf,  adg,  bfd,  bgd 
ae,  aef,  aeg,  bfe,  bge 


most  plausible  A 
least  plausible  y 


Figure  12.5 

Furthermore,  the  assessment  is  Bayesian  relative  to  this  plausibility  order.  First 
of  all,  note  that  D+  = {0,a,ad,b,bf  ,bg)  .n  Let  E be  the  top  equivalence  class 
( E = {0,c }),  F the  middle  one  (F  = {a,b,ad,bf,bg,adf,adg,bfd,bgd})  and  G 
the  bottom  one  (G  = {ae, aef , aeg, bfe, bge}).  Then  only  E and  F have  a non- 
empty intersection  with  D+  and  thus,  by  Definition  12.3,  we  only  need  to 
specify  two  probability  distributions:  vE  and  vF . The  first  one  is  trivial,  since 
ZT  n F = {0}  and  thus  it  must  be  vE(0)  = 1 (and  vE(h)  = 0 for  every  other 
history  h ).  Since  D + nF  = {a,ad,b,bf  ,bg} , by  B1  of  Definition  12.3  it  must  be 
that  vF(h)>  0 if  and  only  if  h e {a,ad,b,bf,bg} . Consider  the  following 
probability  distribution: 


Vf 


r a ad  b bf  bg ^ 

1 I 1 1 2 

V 8 8 8 8 8 7 


(and  vF(h)  = 0 for  every  other  history  h). 


Property  B2  of  Definition  12.3  is  satisfied,  because  vF(ad)  = x 1 , 

SH 1 

=vF(a)  o-(d) 

vF (bf  ) = | x | and  vF(bg)=  | x j . To  check  that  Property  B3  of 

=vF(b)  (J{f)  =vF(b)  <j(g) 

Definition  12.3  is  satisfied,  let  I2  = {a,bf,bg}  be  the  information  set  of  Player  2 
and  /3  = {b,ad,ae}  be  the  information  set  of  Player  3.  Then  vF ( /2)  = v F (a)  + 
vF (bf)  + vF (bg)  = | and  vF (f)  = vF (b)  + vF (ad)  + vF (ae)  = f + { + 0 = f.  Thus 


1 1 Recall  that  D*  ={h  eD:  jLi(h)  > 0} . 
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Mfl) 

MA) 


1 


8_  1 
4 4 

8 


= ju{a), 


vF(bf) 

vF(I2) 


1 


8_  _ 1 
4 4 

8 


= M(kf) , 


yF(bg) 

yFU2) 


2 

JL—2 
4 4 

8 


= M(bg) 


yF(b) 

vF(h) 


= mQ>) , 


vF(ad) 

yFih) 


i 

JL  i 
4 4 

8 


= ju(ad) , 


vF(ae) 

vF(h) 


7 = 0 = /J(ae) . 

8 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  12.E.2  of  Appendix  12.E  at 
the  end  of  this  chapter. 


12.3  Perfect  Bayesian  equilibrium 

By  adding  sequential  rationality  (Definition  10.2)  to  AGM-consistency 
(Definition  12.2)  and  Bayesian  consistency  (Definition  12.3)  we  obtain  a new 
notion  of  equilibrium  for  extensive-form  games. 


Definition  12.4.  Given  a finite  extensive-form  game,  an  assessment  (cr,//)  is  a 
perfect  Bayesian  equilibrium  if  it  is  sequentially  rational,  it  is  rationalized  by  a 
plausibility  order  on  the  set  of  histories  and  is  Bayesian  relative  to  that 
plausibility  order. 


For  an  example  of  a perfect  Bayesian  equilibrium,  consider  the  game  of  Figure 
12.6  (which  is  based  on  the  frame  of  Figure  12.4)  and  the  assessment  considered 
in  the  previous  section,  namely 


' a b c 

d e 

/ 

' ad 

ae  b 

a bf 

bg N 

v0  0 1 

1 0 

1 

3 

2 

3 2 

> B = 

i 

V 4 

o f 

1 1 

4 4 

2 

4 J 
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Figure  12.6 


We  showed  in  the  previous  section  that  the  assessment  under  consideration  is 
AGM  and  Bayesian  consistent.  Thus  we  only  need  to  verify  sequential 
rationality.  For  Player  3 the  expected  payoff  from  playing  f is 
(4)0  + (0)1  + (|)l  = | and  the  expected  payoff  from  playing  g is 

(-5-)  3 + (0)2  + ( j)0  = j ; thus  any  mixture  of  / and  g is  sequentially  rational,  in 


particular  the  mixture 


/ g 


I 
V 3 


2 

3 


. For  Player  2 the  expected  payoff  from  playing 


d is  + + (i)2  + (f)2  = 2 , while  the  expected  payoff  from  playing  e 

is  + + (^)0  + (|)l  = | ; thus  d is  sequentially  rational.  For  Player  1,  c 


gives  a payoff  of  2 while  a gives  a payoff  of  j and  b a gives  a payoff  of  f ; thus  c 
is  sequentially  rational. 


We  now  turn  to  the  properties  of  Perfect  Bayesian  equilibria. 


Theorem  12.1  (Bonanno,  2013).  Consider  a finite  extensive-form  game  and  an 
assessment  (cr,  //) . If  (<J,/u)  is  a perfect  Bayesian  equilibrium  then  (1)  a is  a 
subgame-perfect  equilibrium  and  (2)  (cr,//)  is  a weak  sequential  equilibrium. 
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The  example  of  Figure  12.2  (Section  12.1)  showed  that  not  every  subgame- 
perfect  equilibrium  can  be  part  of  a perfect  Bayesian  equilibrium.  Thus,  by 
Theorem  12.1,  the  notion  of  subgame-perfect  equilibrium  is  a strict  refinement 
of  the  notion  of  subgame-perfect  equilibrium. 


Theorem  12.2  (Bonanno,  2013).  Consider  a finite  extensive-form  game  and  an 
assessment  (cr,//) . If  (cr,//)  is  sequential  equilibrium  then  cr  is  a perfect 
Bayesian  equilibrium. 

Not  every  perfect  Bayesian  equilibrium  is  a sequential  equilibrium.  To  see  this, 
consider  the  game  of  Figure  12.7. 


2 0 11 
10  0 1 

Figure  12.7 

A perfect  Bayesian  equilibrium  is  given  by  the  pure-strategy  profile 
cr  = ( c,d,g ) (highlighted  by  double  edges),  together  with  the  degenerate  beliefs 
ju(a)  = ju(be)  = 1 . In  fact,  (cr,  //)  is  sequentially  rational  and,  furthermore,  it  is 
rationalized  by  the  plausibility  order  shown  in  Figure  12.8  and  is  Bayesian 
relative  to  it  (the  probability  distributions  on  the  equivalence  classes  that 
contain  histories  A with  ju(h)>  0 are  written  next  to  the  order): 
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r 0,c  most  plausible  ^ 

f D0,c}(0)  = 1 ^ 

a,  ad 

*W}(«)  = 1 

b,bd 

- 

be, beg 

n 

V>be,beJbe)  = 1 

ae,  aeg 

- 

bef 

- 

v aef  least  plausible  j 

l - j 

Figure  12.8 


The  belief  revision  policy  encoded  in  a perfect  Bayesian  equilibrium  can  be 
interpreted  either  as  the  point  of  view  of  an  external  observer  or  as  a belief 
revision  policy  which  is  shared  by  all  the  players.  For  example,  the  perfect 
Bayesian  equilibrium  under  consideration  (for  the  game  of  Figure  12.7),  namely 
a = ( c,d,g ) and  ju(a ) = ju{be)  = 1 , reflects  the  following  belief  revision  policy: 
the  initial  beliefs  are  that  Player  1 will  play  c;  conditional  on  learning  that 
Player  1 did  not  play  c,  the  observer  would  become  convinced  that  Player  1 

played  a (that  is,  she  would  judge  a to  be  strictly  more  plausible  than  b)  and 

would  expect  Player  2 to  play  d ; upon  learning  that  Player  1 did  not  play  c and 
Player  2 did  not  play  d,  the  observer  would  become  convinced  that  Player  1 

played  b and  Player  2 played  e,  hence  judging  be  to  be  strictly  more  plausible 

than  ae,  thereby  reversing  her  earlier  belief  that  a was  strictly  more  plausible 
than  b.  Such  a belief  revision  policy  is  consistent  with  the  AGM  rationality 
axioms  (Alchourron  et  al.,  1985)  but  is  incompatible  with  the  notion  of 
sequential  equilibrium.  In  fact,  (cr,  //)  is  not  KW-consistent  (Definition  11.1, 

Chapter  11).  To  see  this,  consider  an  arbitrary  sequence  (cn)it_in  that 


converges  to  cr : <Jn  = 


r a 

b c 

d 

e 

f g ' 

\ P n 

dn 

r 

n 

K l~tn. 

with  lim  pn  = 

n— >oo 


lim  qn  = lim  rn  = lim  tn  = 0 . Then  the  corresponding  sequence  of 

n—> oo  n— >oo  n—> oo  ' ' 

beliefs  obtained  by  Bayesian  updating  is  given  by 


f a 

b 

ae  be  ' 

Pn  = 

Pn 

dn 

Pnrn  _ Pn  djn  _ dn 

^Pn+dn 

Pn+dn 

Pnrn+d„rn  pn+qn  pnrn  + qnrn  p„+q„) 

429 


GAME  THEORY  - Giacomo  Bonanno 


Thus,  M,Xa)  = M,Xae) ; therefore  if  lim//(i(o)  = ju(a)  = 1 then 

n— >oo 

lim  (ae)  = 1 ^ //(ae)  = 0 . 

n— >oo 


By  Theorem  11.2  (Chapter  11),  every  finite  extensive-form  game  with  cardinal 
payoffs  has  at  least  one  sequential  equilibrium  and,  by  Theorem  12.2,  every 
sequential  equilibrium  is  a perfect  Bayesian  equilibrium.  Thus  the  following 
theorem  follows  as  a corollary  of  these  two  results. 

Theorem  12.3.  Every  finite  extensive-form  game  with  cardinal  payoffs  has  at 
least  one  perfect  Bayesian  equilibrium. 

The  relationship  among  the  different  notions  of  equilibrium  introduced  so  far 
(Nash  equilibrium,  subgame-perfect  equilibrium,  weak  sequential  equilibrium, 
perfect  Bayesian  equilibrium  and  sequential  equilibrium)  is  shown  in  the  Venn 
diagram  of  Figure  12.9. 


Nash  equilibria 
Subgame-perfect  equilibria 

Perfect  Bayesian  equilibria^ 


Sequential 

equilibria 


\ Weak  sequential  equilibria 


Figure  12.9 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  12.E.3  of  Appendix  12.E  at 
the  end  of  this  chapter. 
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12.4  Adding  independence 

The  notion  of  perfect  Bayesian  equilibrium  imposes  relatively  mild  restrictions 
on  beliefs  at  information  sets  that  are  reached  with  zero  probability.  Those 
restrictions  require  consistency  between  the  strategy  profile  a and  the  system 
of  belief  //,  as  well  as  the  requirement  of  Bayesian  updating  “as  long  as  possible” 
(that  is,  also  after  beliefs  have  been  revised  at  unreached  information  sets).  The 
example  of  Figure  12.7  showed  that  perfect  Bayesian  equilibrium  is  compatible 
with  a belief  revision  policy  that  allows  a reversal  of  judgment  concerning  the 
behavior  of  one  player  after  observing  an  unexpected  move  by  a different 
player.  One  might  want  to  rule  out  such  forms  of  belief  revision.  In  this  section 
we  introduce  and  discuss  further  restrictions  on  belief  revision  that  incorporate 
some  form  of  independence. 

Situations  like  the  one  illustrated  in  Figure  12.7  are  ruled  out  by  the  following 
restriction  on  the  plausibility  order:  if  h and  h'  belong  to  the  same  information 
set  (h'  e 1(h))  and  a is  an  action  available  at  h (a  e A(h )),  then 

h A h'  if  and  only  if  ha  A h'a  . ( INDl ) 

INDX  says  that  if  h is  deemed  to  be  at  least  as  plausible  as  h’ , then  the  addition 
of  any  action  a must  preserve  this  judgment,  in  the  sense  that  ha  must  be 
deemed  to  be  at  least  as  plausible  as  h’a  , and  vice  versa.  This  restriction  can  be 
viewed  as  an  independence  condition,  in  the  sense  that  observation  of  a new 
action  cannot  lead  to  a change  in  the  relative  plausibility  of  prior  histories.  Any 
plausibility  order  that  rationalizes  the  assessment  discussed  for  the  game  shown 
in  Figure  12.7  violates  INDl , since  it  must  be  such  that  a < b and  also  be<  ae . 

Another  independence  condition  is  the  following,  which  says  that  if, 
conditional  on  history  h,  action  a is  implicitly  judged  to  be  at  least  as  plausible 
as  action  b then  the  same  judgment  must  be  made  conditional  on  any  other 
history  that  belongs  to  the  same  information  set  as  h\  if  h'  e 1(h) 
and  a,b  e A(h) , then 

ha~Ahb  if  and  only  if  h'a  A h’b . (IND7) 


Remark  12.1.  The  two  properties  INDl  and  IND2  are  independent  of  each 
other,  in  the  sense  that  there  are  plausibility  orders  that  satisfy  one  of  the  two 
properties  but  not  the  other  (see  Exercises  12.7  and  12.8). 
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Property  INDl  is  a qualitative  property,  to  which  we  can  add  a corresponding 

quantitative  condition  on  the  probabilities:  we  call  a system  of  beliefs  ju  weakly 
independent  if  it  satisfies  the  following  condition  (recall  that 
D+M={heD:<u(h)>  0}): 


if  h'  e 1(h),  a e A(h),  h'a 


then 


M(h) 

fu(h’) 


e I (ha)  and  { h,h! , ha, h'a } c:  D+ 
ju(ha) 

/j(h'a) 


( ind3) 


The  game  illustrated  in  Figure  12.10  below  shows  that  it  is  possible  for  an 
assessment  (cr,/u)  to  satisfy  Property  /A®,  even  though  //  fails  to  satisfy 
Property  IND, . 


Figure  12.10 


Consider  the  assessment  consisting  of  the  pure-strategy  profile  a = (c,d,g) 

f a b ae  be N 

(highlighted  by  double  edges)  and  the  system  of  beliefs  M=  i , , . 


I i 

4 4 


This  assessment  is  rationalized  by  the  plausibility  order 
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^0,c  most  plausible^ 

a,b,ad,bd 
ae,be,aeg,beg 

Kaef,bef  least  plausible  , 

which  satisfies  Property  INDX , since  a ~ b and  ad  ~ bd  and  ae~  be.  However, 
//  fails  to  satisfy  Property  IND3  since  {a,b,ae,be}  c=  D+  and 

Mg)  _ 1 M(Qe)  _ i 

/u{b)  ju(be) 


Definition  12.5.  Given  a finite  extensive-form  game,  an  assessment  (cr,//)  is 
an  independent  perfect  Bayesian  equilibrium  if  it  is  a perfect  Bayesian 
equilibrium  that  is  rationalized  by  a plausibility  order  that  satisfies  properties 
IND]  and  IND2  and,  furthermore,  //  satisfies  property  IND, . 


The  following  theorem  is  proved  in  Appendix  12. B at  the  end  of  this  chapter, 
using  a result  given  in  the  next  section. 


Theorem  12.4.  Consider  a finite  extensive-form  game  and  an  assessment 
(cr,//).  If  (cr,//)  is  a sequential  equilibrium  then  it  is  an  independent  perfect 
Bayesian  equilibrium. 


Exercise  12.10  shows  that  not  every  independent  perfect  Bayesian  equilibrium 
is  a sequential  equilibrium.  Thus  the  notion  of  sequential  equilibrium  is  a strict 
refinement  of  the  notion  of  independent  perfect  Bayesian  equilibrium,  which  - 
in  turn  - is  a strict  refinement  of  perfect  Bayesian  equilibrium,  as  illustrated  in 
the  Venn  diagram  of  Figure  12.11  below. 
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Perfect  Bayesian  equilibria 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  12.E.4  of  Appendix  f2.E  at 
the  end  of  this  chapter. 


12.5  Filling  the  gap  between  perfect  Bayesian 
equilibrium  and  sequential  equilibrium 

Besides  sequential  rationality,  the  notion  of  perfect  Bayesian  equilibrium 
(Definition  12.4)  is  based  on  two  elements:  (1)  the  qualitative  notions  of 
plausibility  order  and  AGM-consistency  and  (2)  the  notion  of  Bayesian 
consistency  relative  to  the  plausibility  order.  In  this  section  we  will  show  that 
by  strengthening  these  two  conditions  one  obtains  a characterization  of 
sequential  equilibrium.  The  strengthening  of  the  first  condition  is  that  the 
plausibility  order  that  rationalizes  the  given  assessment  has  a “cardinal” 
numerical  representation  that  can  be  interpreted  as  measuring  the  plausibility 
distance  between  histories  in  a way  that  is  preserved  by  the  addition  of  a 
common  action.  The  strengthening  of  the  second  condition  imposes  uniform 
consistency  on  the  conditional  probability  distributions  on  the  equivalence 
classes  of  the  plausibility  order,  by  requiring  that  the  relative  probabilities  of 
two  decision  histories  in  the  same  information  set  be  preserved  when  a 
common  action  is  added. 
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Definition  12.6.  Given  a finite  extensive  form  and  a plausibility  order  F on 
the  set  of  histories  H,  a function  F : Ft  — > N (where  N denotes  the  set  of  non- 
negative integers)  is  an  integer-valued  representation  of  F if,  for  all  h,h'  e H , 

F(h ) < F(h')  if  and  only  if  h F h' . 


An  integer-valued  representation  F of  a plausibility  order  is  analogous  to  an 
ordinal  utility  function  for  a preference  relation:  it  is  just  a numerical 
representation  of  the  order.  Note  that,  in  the  case  of  a plausibility  order,  we 
find  it  more  convenient  to  assign  lower  values  to  more  plausible  histories  (while 
a utility  function  assigns  higher  values  to  more  preferred  alternatives  or 
outcomes). 


Remark  12.2.  If  F : H — > N is  an  integer-valued  representation  of  a plausibility 
order  F on  the  set  of  histories  H,  without  loss  of  generality  we  can  assume  that 
F(0)  = 0 (recall  that  0 denotes  the  null  history,  which  is  always  one  of  the 
most  plausible  histories).12 

Remark  12.3.  Since  H is  a finite  set,  an  integer-valued  representation  of  a 
plausibility  order  F on  H always  exists.  A natural  integer-valued 
representation  is  the  following.  Define 

H0  = {h  e H : h F x,  for  all  x e Ft) 

(thus  H0  is  the  set  of  most  plausible  histories  in  Ft),  and 

H1  = {h  e //  \H0  : h Fx,  for  all  xe  Ft  \ Ft  0 } 

(thus  Hl  is  the  set  of  most  plausible  histories  among  the  ones  that  remain  after 
removing  the  set  H0  from  Ft)  and,  in  general,  for  every  integer  k>  1 define 
Hk  = {h  e Ft  \(Ft0u ...u H k_0  : h F x,  for  all  x e H \ (Ft0u  ...u H t . Since  H 
is  finite,  there  is  an  m eN  such  that  {Ft0,...,Ftm}  is  a partition  of  H and,  for 
every  j,  k e N , with  j <k<  m , and  for  every  h,  li'  e II , if  h e //;  and  h'  e Hk 

then  h < h! . Define  F : Ft  — > N as  follows:  F(h)  = k if  and  only  if  h e Ftk ; then 
the  function  F so  defined  is  an  integer-valued  representation  of  F and 


12  Let  F:H— be  an  integer-valued  representation  of  a plausibility  order  ^ and  define 
F : H — > N as  follows:  F(h)  = F(h)-F(0)  ■ Then  F is  also  an  integer-valued  representation  of 
^ and  F(0)  = 0 ■ 
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F(0)  = 0 ■ 

Definition  12.7.  Given  a finite  extensive  form,  a plausibility  order  ^ on  the 
set  of  histories  H is  choice  measurable  if  it  has  at  least  one  integer-valued 
representation  F that  satisfies  the  following  property:  for  all  he  D , for  all 
h'  e 1(h)  and  for  all  a e A(h)  ,13 

F(h)  - F(h')  = F(ha ) - F(h'a) . (CM) 


If  we  think  of  F as  measuring  the  “plausibility  distance”  between  histories,  then 
we  can  interpret  choice  measurability  (CM)  as  a distance-preserving  condition: 
the  plausibility  distance  between  two  histories  in  the  same  information  set  is 
preserved  by  the  addition  of  the  same  action. 

For  example,  consider  the  extensive  form  of  Figure  12.12. 


Figure  12.12 


The  assessment  consisting  of  the  pure-strategy  profile  a = ( a,e ) (highlighted  by 


double  edges)  and  the  system  of  beliefs  ju  = 


r b c 

vl  0y 


(the  grey  node  in  Player  2’s 


information  set  represents  the  history  which  is  assigned  zero  probability)  is 
rationalized  by  the  plausibility  order  shown  in  Figure  12.13  below  together 
with  two  integer-valued  representations  (F  is  the  one  described  in  Remark 


13 


Recall  that  D is  the  set  of  decision  histories,  1(h)  the  information  set  that  contains  decision 
history  h and  A(h)  the  set  of  actions  available  at  decision  history  h. 
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12.3).  While  F does  not  satisfy  Property  CM  (since  F(c)-F(b)  = 3-1  = 2 
^ F{cf)-F(bf ) = 5-  2 = 3),  F does  satisfy  Property  CM.  Since  there  is  at  least 
one  integer-valued  representation  that  satisfies  Property  CM,  by  Definition 
12.7  the  plausibility  order  of  Figure  12.13  is  choice  measurable. 


0,o  0 0 

b, be  1 1 

bf  2 3 

c, ce  3 4 

d 4 5 

cf  5 6 

Figure  12.13 

On  the  other  hand,  the  plausibility  order  shown  in  Figure  12.14  (which 
reproduces  Figure  12.8  and  refers  to  the  game  of  Figure  12.7)  is  not  choice 
measurable,  because  any  integer-valued  representation  F must  be  such  that 
F{b)  - F(a)  > 0 while  F{be)  - F(ae ) < 0 . 

' 0,c  ' 

a,  ad 

b, bd 
be,  beg 
ae,  aeg 

bef 

v aef  J 

Figure  12.14 
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Rationalizability  by  a choice  measurable  plausibility  order  is  one  of  the  two 
properties  that  are  needed  to  fill  the  gap  between  perfect  Bayesian  equilibrium 
and  sequential  equilibrium.  The  second  property  is  a strengthening  of  the 
notion  of  Bayesian  consistency  (Definition  12.3). 

To  set  the  stage  for  the  next  definition,  let  (cr,//)  be  an  assessment  which  is 
rationalized  by  a plausibility  order  f . As  before,  let  D + be  the  set  of  decision 
histories  to  which  //  assigns  positive  probability:  D+  = {heD:  ju(h)  > 0} . Let 
ZT  be  the  set  of  equivalence  classes  of  A that  have  a non-empty  intersection 
with  ZT . Then  ZT  is  a non-empty,  finite  set.  Suppose  that  (cr,  //)  is  Bayesian 
relative  to  A (Definition  12.3)  and  let  \ve) h:rt:>  be  a collection  of  probability 

distributions  that  satisfy  the  properties  of  Definition  12.3.  We  call  a probability 
distribution  v : D — > (0,1]  (recall  that  D is  the  set  of  all  decision  histories)  a full- 

support  common  prior  of  [ve] Et=£  if,  for  every  E eZ),  the  probability 
distribution  vE  coincides  with  the  distribution  obtained  by  conditioning  v on 

def 

the  set  E nD)  , that  is,  for  all  h e E nDf  vE(h)  = v(h  E n ZT)  = 

h'eEnD* 

Note  that  a full  support  common  prior  v assigns  positive  probability  to  all 
decision  histories,  not  only  to  those  in  D+  . 

Remark  12.4.  Since  any  two  elements  of  ZT  are  mutually  disjoint,  a full 

support  common  prior  always  exists;  indeed,  there  is  an  infinite  number  of 
them.  Furthermore,  it  can  be  shown  (see  Bonanno,  2015)  that  there  always 
exists  a full-support  common  prior  v that  satisfies  the  following  properties:  if 
a € A(h)  and  ha  eD  , then  (1)  v(ha)  < v(h)  and  (2)  if  cr(a)  > 0 then 
v(ha)  = v(h)x<j(a) . 

The  following  definition  requires  that,  among  the  many  full-support  common 
priors,  there  be  one  that,  besides  the  properties  of  Remark  12.4,  satisfies  the 
additional  property  that  the  relative  likelihood  of  any  two  histories  in  the  same 
information  set  be  preserved  by  the  addition  of  the  same  action. 


v(h) 

Z v(h’)' 
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Definition  12.8.  Consider  an  extensive  form.  Let  (cr,//)  be  an  assessment  that 
is  rationalized  by  a plausibility  order  f on  the  set  of  histories  H and  is 
Bayesian  relative  to  it  and  let  { ve\e^s*  be  a collection  °f  probability 

distributions  that  satisfy  the  properties  of  Definition  12.3.  We  say  that  (cr,//)  is 
uniformly  Bayesian  relative  to  f if  there  exists  a full-support  common  prior 
v:D— > (0, 1]  of  \ve) Krg<  that  satisfies  the  following  properties: 

UB1.  If  a e A{h)  and  ha&D  then  (1)  v(ha)  <v(h)  and  (2)  if  a(a)  > 0 then 
v(ha)  = v(h)  x <j(a) . 


UB2.  If  a e A(h) , h and  h!  belong  to  the  same  information  set  and  ha,h'a 
v(h)  v(ha) 


D 


then 


v(h')  vllia ) 


UB1  is  the  property  of  Remark  12.4,  which  can  always  be  satisfied  by  an 
appropriate  choice  of  a full-support  common  prior.  UB2  requires  that  the 
relative  probability,  according  to  the  common  prior  v,  of  any  two  histories 
that  belong  to  the  same  information  set  remain  unchanged  by  the  addition  of 
the  same  action. 


Choice  measurability  and  uniform  Bayesian  consistency  are  independent 
properties.  For  example,  the  perfect  Bayesian  equilibrium  a = (c,d,g)  and 
ju{a ) = ju(be ) = 1 of  the  game  of  Figure  12.7  is  such  that  any  plausibility  order 
that  rationalizes  it  cannot  be  choice  measurable14  and  yet  (cr,//)  is  uniformly 
Bayesian  relative  to  the  plausibility  order  shown  in  Figure  12.8  that  rationalizes 
it.15  On  the  other  hand,  the  perfect  Bayesian  equilibrium  <j  = (c,d,g) , 

of  the  game  of  Figure  12.10  is  rationalized  by  the  choice 


r a 

b 

ae  be ^ 

3 

1 

1 3 

V 4 

4 

4 4 y 

14  Because,  by  P2  of  Definition  12.2,  any  such  plausibility  order  ^ would  have  to  satisfy  a -<  b 
and  be<ae,  so  that  any  integer-valued  representation  F of  it  would  be  such  that 
F(b)-F(a)  >0  and  F(be)-F(ae)  < 0 

15  As  can  be  seen  by  taking  v to  be  the  uniform  distribution  over  the  set  D = [0,a,b,ae,be] 

(that  is,  |/(/7)  = i for  every  heD ):  UB1  is  clearly  satisfied  and  UB2  is  also  satisfied,  since 

v(a)  _ J _ v(ae) 

v(b)  ) i '(be) 
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measurable  plausibility  order 


but  it  cannot  be 


^0,c  most  plausible 

a,b,ad,bd 
ae,be,aeg,beg 

aef,bef  least  plausible 

uniformly  Bayesian  relative  to  any  plausibility  order  that  rationalizes  it.16 


We  can  now  state  the  main  result  of  this  section,  namely  that  choice 
measurability  and  uniform  Bayesian  consistency  are  necessary  and  sufficient  for 
a perfect  Bayesian  equilibrium  to  be  a sequential  equilibrium. 


Theorem  12.5  (Bonanno,  2015).  Consider  a finite  extensive-form  game  and  an 
assessment  (cr,//).  The  following  are  equivalent: 

(A)  (cr,//)  is  a sequential  equilibrium, 

(B)  (cr,//)  is  a perfect  Bayesian  equilibrium  that  is  rationalized  by  a choice- 
measurable  plausibility  order  and  is  uniformly  Bayesian  relative  to  it. 


Thus  Theorem  12.5  provides  a characterization  (or  understanding)  of  sequential 
equilibrium  which  is  free  of  the  questionable  requirement  of  taking  a limit  of 
sequences  of  completely  mixed  strategies  and  associated  systems  of  beliefs  (see 
the  remarks  in  Chapter  11,  Section  11.3). 

Using  Theorem  12.5  one  can  prove  Theorem  12.4,  namely  that  a sequential 
equilibrium  is  an  independent  perfect  Bayesian  equilibrium.  The  proof  is  given 
in  Appendix  B at  the  end  of  this  chapter. 

As  an  application  of  Theorem  12.5,  consider  the  extensive-form  game  of  Figure 
12.15  below. 


1,1  Because,  by  P2  of  Definition  12.2,  any  such  plausibility  order  would  have  to  have  the 
following  equivalence  classes:  E = {a,b,ad,bd}  and  F = {ae,be,  aeg,beg]  (thus 

Er^D*  = {a,b}  and  F n D*  = {ae,be}y  Hence,  if  v is  any  common  prior  then 

vE(a)  = *-l— 1 and  vE(b)  = * — . By  B3  of  Definition  12.3,  F(a)=  ( , 

£ v(a)  + v(b)  v(a)  + v(b)  y vE(a)  + vE(b) 


fi(b)  = - 


vE(b) 


and 

vF(a)  + vF(b) 

v(ae)  _ vF(ae)  _ ju(ae)  _ , 
v{be) 


Thus 


v(a)  = vE(a)  = u(a) 
V(b)  vE(b)  m 


= 3 


and,  similarly, 


. - r , , — 3 , yielding  a violation  of  UB2  of  Definition  12.8. 

vF  (be)  ju(be)  1 & 
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Figure  12.15 


Using  Theorem  12.5  let  us 

(be 

<j  = (a,T,f,L ) and  // = ± ± 

V 10  10 


show  that  the  assessment  (cr,//)  with 
bB  cBf  d ) 

is  a sequential  equilibrium. 

T?  Is  Is  J 


First  we  check  sequential  rationality.  For  Player  1 at  the  root  the  possible 
payoffs  are:  1 with  a,  0 with  b,  0 with  c (more  precisely,  with  either  ce  or  cf)  and 
0 with  d;  thus  a is  sequentially  rational.  For  Player  1 at  history  cB  the  possible 
payoffs  are:  1 with  e and  2 with/;  thus/  is  sequentially  rational.  For  Player  2 
the  possible  payoffs  are:  / (3) + -/(())  = ^ with  T and  ^-(0)  + ^(3)  =-fe  with  B; 


thus  T is  sequentially  rational.  For  Player  3 the  possible  payoffs  are: 

Z(l)  + ll(0)+^(l)  = Tf  with  L and  l8(°)  + ^(1)  + ll(0)^TT  with  R’>  thus  L is 


sequentially  rational.  By  Theorem  12.5  it  only  remains  to  show  that  (cr,//)  is 
rationalized  by  a choice-measurable  plausibility  order  and  is  uniformly 
Bayesian  relative  to  it.  Indeed,  (cr,//)  is  rationalized  by  the  plausibility  order 
shown  in  Figure  12.16  together  with  a choice-measurable  integer-valued 
representation  F. 
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(<  : 

0,  a 

0 

b,c,bT,cT 

1 

d,  bB,  cB,cBf,  dL,  bBL,cBfL 

2 

v dR , bBR , cBfR , cBe 

3 J 

Figure  12.16 

To  see  that  (cr,//)  is  uniformly  Bayesian  relative  to  this  plausibility  order,  let 
Ex,E2  and  E3  be  the  top  three  equivalence  classes  of  the  order  and  consider  the 
following  probability  distributions,  which  satisfy  the  properties  of  Definition 
12.3  (so  that  ( a,jU ) is  Bayesian  relative  to  this  plausibility  order):  v£(0)  = l, 


rb 

and  vF  = 

' bB 

cB 

cBf 

d ' 

; then  a full  support  common 

VE  = 

7 

3 

7 

3 

3 

8 

v io  loy 

V 21 

21 

21 

21  / 

prior  that  satisfies  the  properties  of  Definition  12.8  is  the  one  shown  in  Figure 
12.17  below: 


^0  b bB  c cB  cBf  d ^ 

_9_  J_  J_  _3_  J_  J_  _8_ 

V 40  40  40  40  40  40  40  / 


Figure  12.17 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced  in 
this  section,  by  going  through  the  exercises  in  Section  12.E.5  of  Appendix  12.E  at 
the  end  of  this  chapter. 
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Appendix  12. A: 

History-based  definition  of  extensive-form  game 

If  A is  a set,  we  denote  by  A*  the  set  of  finite  sequences  in  A.  If 

h = (ax,...,af)  e A*  and  1<  j <k  , the  sequence  h!  = (ax,...,a^j  is  called  a prefix 

of  h (a  proper  prefix  of  h if  j <k ).  If  h = (al,...,ak)  e A*  and  aeA,  we  denote 

the  sequence  (al,...,ak,a)  e A*  by  ha.  A finite  extensive  form  is  a tuple 

(A,H ,N ,/,{«.} whose  elements  are: 

• A finite  set  of  actions  A. 

• A finite  set  of  histories  H c:  A*  which  is  closed  under  prefixes  (that  is,  if 
heH  and  h'  e A*  is  a prefix  of  h,  then  h'eH).  The  null  (or  empty) 
history  ( ) , denoted  by  0 , is  an  element  of  H and  is  a prefix  of  every 
history  (the  null  history  0 represents  the  root  of  the  tree).  A history  heH 
such  that,  for  everyaeZ,  ha  i H , is  called  a terminal  history.  The  set  of 
terminal  histories  is  denoted  by  Z.  D = H\Z  denotes  the  set  of  non- 
terminal or  decision  histories.  For  every  decision  history  he  D , we  denote 
by  A(h)  the  set  of  actions  available  at  h,  that  is,  A(h)  = {a  e A:  ha  e H } . 

• A finite  set  N = {1  ,...,n)  of  players.  In  some  cases  there  is  also  an  additional, 
fictitious,  player  called  chance. 

• A function  cD->iVu {chance}  that  assigns  a player  to  each  decision 
history.  Thus  i{h)  is  the  player  who  moves  at  history  h.  A game  is  said  to 
be  without  chance  moves  if  i{h)  e N for  every  heD . For  every 
i e N {J  {chance}  , let  D.  = A1  (/)  be  the  set  of  histories  assigned  to  Player 
Thus  { Dchance,D1,...,Dn } is  a partition  of  D.  If  history  h is  assigned  to  chance 
(i(h)  = chance ),  then  a probability  distribution  over  A{h)  is  given  that 
assigns  positive  probability  to  every  a e A{h) . 

• For  every  player  i e N,  «.  is  an  equivalence  relation  on  D: . The 
interpretation  of  h «.  h!  is  that,  when  choosing  an  action  at  history  h e Dj , 
Player  i does  not  know  whether  she  is  moving  at  h or  at  h' . The 
equivalence  class  of  heDi  is  denoted  by  Ifh)  and  is  called  an  information 
set  of  Player  i;  thus  Ifh)  = {h!  e D(  :h  «.  h'} . The  following  restriction 
applies:  if  h!  e Ifh)  then  A(h')  = A{h) , that  is,  the  set  of  actions  available  to 
a player  is  the  same  at  any  two  histories  that  belong  to  the  same 
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information  set  of  that  player. 

• The  following  property,  known  as  perfect  recall,  is  assumed:  for  every 
Player  i e.  N , if  hl,h2eDi,  a e A(l\)  and  hxa  is  a prefix  of  h2  then,  for 

every  ti  e Ii(h2)  , there  exists  an  h e Iff)  such  that  ha  is  a prefix  of  h' . 
Intuitively,  perfect  recall  requires  a player  to  remember  what  she  knew  in 
the  past  and  what  actions  she  took  previously. 

• Given  an  extensive  form,  one  obtains  an  extensive  game  by  adding,  for  every 
Player  / e N , a utility  (or  payoff)  function  Ui : Z — » M (where  K.  denotes  the 
set  of  real  numbers;  recall  that  Z is  the  set  of  terminal  histories). 

Figure  12. A.  1 below  shows  an  extensive  form  without  chance  moves  where 
A = {a,b,s,c,d,e,f,g,h,m,n},  D = {0,a,b,ac,ad  ,acf,ade,adf}  (to  simplify  the 

notation  we  write  a instead  of  (0,a),  ac  instead  of  (0,a,c),  etc.), 

Z = {s,ace,acfg,acfh,adeg,adeh,adfm,adfn,bm,bn} , H = Du  Z , 

A(0)  = {a,  b,  5}  , A(a ) = {c,  dj , A(ac ) = A(ad ) = {e,  /} , 

A(acf)  = A{ade)  = {g,  h} , A(adf)  = A(b ) = {m,  n} , 

i(0)  = 1 , i{a)  = 2 , i(ac)  = i(ad)  = 3 , i(acf)  = i(ade)  = i(adf)  = i(b)  = 4 , 

«j  = {(0,0)},  «2  = {(a,  a)} , »3  = {(ac,ac),(ac,ad),(ad,ac),(ad,ad)}  and 
*4  = { (acf , acf ),  ( acf , ade),  ( ade , acf),  ( ade , ade ),  ( adf , adf ),  ( adf , h),  (h,  at/F),  (h,  h)} . 
The  information  sets  containing  more  than  one  history  (for 
example, /4(h)  = {adf  ,b})  are  shown  as  rounded  rectangles.  The  root  of  the  tree 
represents  the  null  history  0 . 


Figure  12.A.1 
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If  h and  h'  are  decision  histories  not  assigned  to  chance,  we  write  h'  e 1(h)  as 
a short-hand  for  h'  e Il(h)(h ) . Thus  h'  e 1(h)  means  that  h and  h'  belong  to  the 

same  information  set  (of  the  player  who  moves  at  h).  If  h is  a history  assigned  to 
chance,  we  use  the  convention  that  1(h)  = {h}  . 


Given  an  extensive  form,  a pure  strategy  of  Player  / e N is  a function  that 
associates  with  every  information  set  of  Player  i an  action  at  that  information 
set,  that  is,  a function  st  :D.—>  A such  that  (1)  sfh)  e A(h)  and  (2)  if  h'  e b(h) 
then  st(h')  = sfh) . For  example,  one  of  the  pure  strategies  of  Player  4 in  the 
extensive  form  illustrated  in  Figure  12. A. 1 is  s4(acf)  = s4(ade)  = g and 
s4(adf)  = s4(b)  = m . A behavior  strategy  of  player  i is  a collection  of  probability 
distributions,  one  for  each  information  set,  over  the  actions  available  at  that 
information  set;  that  is,  a function  cr.  : D.  — » A.(A)  (where  A(A)  denotes  the 
set  of  probability  distributions  over  A)  such  that  (1)  cr.(/t)  is  a probability 
distribution  over  A(h)  and  (2)  if  h'  e /.(//)  then  cr.(h')  = crfh) . If  the  game 
does  not  have  chance  moves,  we  define  a behavior  strategy  profile  as  an  n-tuple 
cr  = (crx,...,crn)  where,  for  every/ e N,  cr.  is  a behavior  strategy  of  Player  /.  If 
the  game  has  chance  moves  then  we  use  the  convention  that  a behavior  strategy 
profile  is  an  (n-l-l)-tuple  cr  = (crl,...,crn,crchance)  where,  if  h is  a history  assigned 

to  chance  and  a e A(h) , then  <Jchance (h)(a)  is  the  probability  associated  with  a. 
When  there  is  no  risk  of  ambiguity  (e.g.  because  no  action  is  assigned  to  more 
than  one  information  set)  we  shall  denote  by  <j(a)  the  probability  assigned  to 

action  a by  the  relevant  component  of  the  strategy  profile  cr.17  Note  that  a 
pure  strategy  is  a special  case  of  a behavior  strategy  where  each  probability 
distribution  is  degenerate.  A behavior  strategy  is  completely  mixed  at  history 
he  D if,  for  every  a e A(h) , a(a)  > 0 . For  example,  in  the  extensive  form  of 

(a  b ^ 

Figure  12. A.  1 a possible  behavior  strategy  for  Player  1 is 


which 


a b s 

0 0 1 

can  be  more  simply  denoted  by  5 (and  which  coincides  with  a pure  strategy  of 


v 


Player  1)  and  a possible  behavior  strategy  of  Player  2 is 
completely  mixed  strategy). 


h c 

i 

V3 


d 


2 

3 J 


(which  is  a 


17  If  heDj  and  cr  is  the  i"'  component  of  cr,  then  cr.(/?)  is  a probability  distribution  over 
A(h);  thus  if  a e A(h)  then  ayh)(a)  is  the  probability  assigned  to  action  a by  cr  f/t):  we 
denote  ayh^a)  more  succinctly  by  a(a ) . 
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Appendix  12. B: 

Proof  of  Theorem  12.4  using  Theorem  12.5 

We  prove  Theorem  12.4,  namely  that  a sequential  equilibrium  is  an 
independent  perfect  Bayesian  equilibrium,  using  Theorem  12.5.  To  simplify  the 
notation,  we  will  assume  that  no  action  is  available  at  more  than  one 
information  set,  that  is,  that  if  a e A(h)  n A(h')  then  h'  e 1(h)  (this  is  without 
loss  of  generality,  because  we  can  always  rename  some  of  the  actions). 

Let  ( <7,ju ) be  a sequential  equilibrium.  First  we  show  that  Property  IND]  is 
satisfied.  By  Theorem  12.5  there  is  a choice  measurable  plausibility  order  ^ 
that  rationalizes  (<j,/u).  Let  F be  an  integer-valued  representation  of  ^ that 
satisfies  Property  CM  of  Definition  12.6  and  let  h'  e 1(h)  and  a e A(h) . 
Suppose  that  h h' ; then  F(h)  < F(h') , that  is,  F(h)  - F(h')  < 0 and  thus,  by 
CM,  F(ha)-  F(h'a)  < 0 , that  is,  F(ha ) < F(h'a) , which  implies  that  ha  F h'a  ; 
conversely,  if  ha  ^ h’a  then  F (ha)  - F (h’ a)  < 0 and,  by  Property  CM, 
F(h)-F(h')  < 0 , which  implies  that  h F h' . Thus  Property  INDl  is  satisfied. 

Next  we  shows  that  Property  CM  is  equivalent  to  the  following  property: 

if  /z'e/(/z)  and  a,b^A(h)  then  F(hb)-F(ha)  = F(h'b)-F(h'a)  (♦) 

Let  ^ be  a plausibility  order  on  the  set  of  histories  H and  let  F : H — > N be  an 
integer-valued  representation  of  ^ that  satisfies  Property  CM  of  Definition 
12.6.  Without  loss  of  generality  (see  Remark  12.2),  we  can  assume  that 
F(0)  = 0 . For  every  decision  history  h and  action  a e A(h) , define 
A(a)  = F (ha)  - F (h)  . The  function  A : A — > N is  well  defined,  since,  by 
assumption,  no  action  is  available  at  more  than  one  information  set  and,  by 
CM,  if  h'  e 1(h)  then  F(h'a)- F(h')  = F(ha)- F(h) . Then,  for  every  history 

m 

h = {ava2,...,am) , F(h)  = 'YJA(ai)  . In  fact, 

i=l 

A(ax)  + A(a2)  + ...  + A(am)  = [F(al)-F(0))  + [F(axa2)  -F(al))  + ... 

+ {F(ala2...am)-F(ala2...am_l))=  F(aia2...am)  = F(h) 

(recall  that  F(0)  = 0).  Thus,  for  every  /zeD  and  a e A(h) , F(ha)  = 
F(h)  + A(a) . Hence,  F(hb)  - F(ha)  = F(h)  + A(b)  - (F(h)  + A(a ))  = A(b)  - A(a) 
and  F(h'b)~  F(h' a)  = F(h')  + A(b)-(yF(h')  + A(aj)  = A(b)~  A(a)  so  that 
F(hb)-F(ha)  = F(h'b)~ F(h'a) . Thus  we  have  shown  that  CM  implies  (♦). 
Now  we  show  the  converse,  namely  that  (♦)  implies  (CM).  Let  ^ be  a 
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plausibility  order  on  the  set  of  histories  H and  let  F : H — > N be  an  integer- 
valued representation  of  ^ that  satisfies  (♦).  Select  arbitrary  h'  e 1(h)  and 
a e A(h) . Let  b e A(h)  be  a plausibility-preserving  action  at  h (there  must  be  at 
least  one  such  action:  see  Definition  12.1);  then,  h~hb  and  h'~h'b.  Hence, 
since  F is  a representation  of  ^ , F(hb ) = F(h)  and  F(h'b ) = F(h')  so  that 
F(h')-F(h)  = F(h'b)-F(hb).  By  (♦),  F(h'b)-F(hb)  = F(h'a)- F(ha)  . From 
the  last  two  equalities  it  follows  that  F(h')-F(h)  = F (h' a)  - F (ha) , that  is,  CM 
holds. 


Since  Property  CM  is  equivalent  to  Property  (♦)  and  the  latter  implies 
Property  IND2 , we  have  also  proved  that  a sequential  equilibrium  satisfies 


IND2. 


It  only  remains  to  show  that  a sequential  equilibrium  satisfies  Property  IND2 . 
Let  ( a,/j ) be  a sequential  equilibrium.  Then  by  Theorem  12.5  there  is  a choice 
measurable  plausibility  order  ^ that  rationalizes  (cr,  ju) , relative  to  which 
( <j,jU ) is  uniformly  Bayesian.  Let  a e A(h) , h'a^I(ha)  and 

\h,h',ha,h'a}ciD+M.  Since  h'el(h),  ju(h)>  0 and  ju(h')>  0,  by  P2  of 
Definition  12.2,  h and  h’  are  most  plausible  histories  in  1(h)  and  thus  h ~ h’ . 
Similarly  ha  and  h'a  are  most  plausible  histories  in  I (ha)  and  thus  ha  ~ h’a . 
Let  E be  the  equivalence  class  to  which  h and  h'  belong,  F the  equivalence  class 
to  which  ha  and  h’a  belong  and  let  vE  and  vF  be  the  probability  distributions 
postulated  by  Definition  12.3.  Let  v be  a uniform  common  prior  (Definition 

v(h)  v(ha) 


12.8).  Then,  by  UB2  of  Definition  1.8 

v(h) 


common  prior  vE(h)  = - 


, vE(h')  = 


v(h')  v(h'a) 
v(h') 


and  by  definition  of 
v(ha) 


X K(x) 


, vF(ha)  = - 


X v(t) 


xeEnD,, 


xeEnD ,, 


y&FnD 


and  vF(h'a)  = 


v(h'a) 

X 

y^EnD* 


. Thus  rA-GAA  and  so  that 

vE(h')  v(h')  vF(h'a)  v(h'a) 


(sincerW=AM)  vM_=v£m_  By  Property  B3  of  Definition  12.3, 

' - . / 7 t\  . .SI  f \ . SI  f\  /if  \ J F J J 


v(h’)  v(h’a)  vE(h’)  vF(h’a) 
vF(ha)  _ ju(ha) 


vE(h)  fu(h) 


and 


UiJLVt  . 

vE(h’)  ju(h’)  vF(h’a)  ju(h’a) 


thus  AA  = AM,thatis,  Propeny 
ju(h’)  ju(h’a) 


/ND,  is  satisfied. 
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Appendix  12. E:  Exercises 

12.E.1.  Exercises  for  Section  12.1: 

Belief  revision  and  AGM  consistency 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  12.1.  Consider  the  following  game: 

1 


2 

0 

1 

0 


Determine  if  there  is  a plausibility  order  that  rationalizes  the  following 
assessment  (Definition  12.2): 


r a b s 

c d 

* / 

g 

f ac  ad 

acf 

ade  adf  b ^ 

I 0 — 

V 3 U 3 

0 1 

1 1 

2 2 

1 

» M = 

v°  1 

1 

5 

i 1 o 

5 5 uy 
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Exercise  12.2.  Consider  again  the  game  of  Exercise  12.1.  Find  all  the 
assessments  that  are  rationalized  by  the  following  plausibility  order: 

'$,s  most  plausible^ 

a,ac,ad,  ace,  ade,  adeg,b,bg 
acf  ,adf  ,acfg,adfg 
adeh,bh 

Kacfh,  adfh  least  plausible  y 


12.E.2.  Exercises  for  Section  12.2:  Bayesian  consistency 


The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 


Exercise  12.3.  Consider  the  game  of  Figure  12.4,  reproduced  below,  and  the 


assessment 


a = 


' a b c 

d 

e 

/ 

'ad  ae  b 

a bf 

bg ' 

v0  0 1 

1 

0 

1 

3 

2 

3 V1 

> M = 

i 0 1 

V 4 U 4 

1 1 

4 4 

2 

4 y 

which  is  rationalized  by  the  plausibility  order 


0,  c most  plausible N 

a,ad,adf,adg,b,bf,bg,bfd,bgd 

ae,  aef,  aeg,  bfe,  bge  least  plausible 

1 


c 


2 2 3 3 2 0 2 1 

0 3 1 2 1 2 0 1 
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Let  F = {a,ad,adf,adg,b,bf,bg,bfd,bgd}  be  the  middle  equivalence  class  of 
the  plausibility  order.  It  was  shown  in  Section  12.2  that  the  probability 

^ ^ satisfies  the  properties  of  Definition 


distribution  vF  = 


12.3.  Show  that  there  is  no  other  probability  distribution  on  F that  satisfies 
those  properties. 


Exercise  12.4.  Consider  the  following  extensive-form  and  the  assessment 


f 

/ L R 

l r'] 

r a b 

c 

d 

e\ 

cr  = 

f,A, 

i i 

1 4 

» M = 

v°  1 

2 

3 1 

V 

V 2 2 

5 5 J 

) 

3 

1 4 

4 J 

(a)  Verify  that  the  given  assessment  is  rationalized  by  the  plausibility  order 

b,  c,d,e, bL,bR , cL,cR,  d l,  dr, et,  er,  erD  . 

a,aL,aR,fB,erC  y 

(b)  Let  E be  the  top  equivalence  class  of  the  plausibility  order:  E = {0,f,fA}. 
Show  that  there  is  a unique  probability  distribution  vE  on  E that  satisfies 
the  properties  of  Definition  12.3. 

(c)  Let  F be  the  middle  equivalence  class  of  the  plausibility  order: 
F = {b,c,  d,e,bL,bR,cL,cR,d£,dr,e£,er,erD} . Show  that  both  of  the 
following  probability  distributions  satisfy  the  properties  of  Definition  12.3: 


( b 

C 

d 

e 

er  3 

; 

' b 

c 

d 

e 

er' 

VF  = 

20 

40 

45 

15 

TT  1 

II 

r\ 

5 

10 

45 

15 

12 

^ 132 

132 

132 

132 

132/ 

\ 87 

87 

87 

87 

87  / 
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12.E.3.  Exercises  for  Section  12.3:  Perfect  Bayesian  equilibrium 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  12.5.  Consider  the  following  extensive-form  game: 


3 13  3 1 3 

0 10  11  0 

2 10  13  1 


Find  two  perfect  Bayesian  equilibria  (cr,//)  and  (cr',//')  where  both  cr  and  cr' 
are  pure-strategy  profiles  and  cr  ^ cr' . 
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Exercise  12.6.  Consider  the  following  game: 


Prove  that  the  following  assessment  is  a perfect  Bayesian  equilibrium: 


a = 

f 

s, 

c d 

i i 

* / 
1 2 

g h ' 
1 1 

» M = 

r ac  ad 

i i 

acf  ade  adf  b ' 

2 12  6 

V 

2 2 

3 3 

2 2 ) 

V 2 2 

11  11  11  11/ 

12.E.3.  Exercises  for  Section  12.4:  Adding  independence 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  12.7.  Draw  an  extensive  form  where  Player  1 moves  first  and  Player  2 
moves  second  without  being  informed  of  Player  l’s  choice.  Player  1 chooses 
between  a and  b,  while  Player  2’s  choices  are  c,  d and  e.  Find  an  assessment 
( cr,jU ) which  is  rationalized  by  a plausibility  order  that  satisfies  Property  IND{ 
but  fails  Property  IND2 . 
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Exercise  12.8.  Find  an  extensive  form  and  an  assessment  (cr,//)  which  is 
rationalized  by  a plausibility  order  that  violates  Properties  INDX  but  satisfies 
Property  IND2  and,  furthermore,  //  satisfies  Property  IND3 . 

12.E.1.  Exercises  for  Section  12.5:  Filling  the  gap  between 

perfect  Bayesian  equilibrium  and  sequential  equilibrium 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  12.9.  Consider  the  following  (partial)  extensive  form: 


Using  Theorem  12.5,  prove  that  there  is  no  sequential  equilibrium  (cr,//) 
where  a = (a,g,r,...)  (that  is,  cr  assigns  probability  1 to  a,  g and  r), 
ju(c ) = ju(e ) = ju(ch ) = 0 and  ju(h)  > 0 for  every  other  decision  history  h. 
[Hint:  consider  all  the  possible  plausibility  orders  that  rationalize  (cr,//)  .] 
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Exercise  12.10.  Consider  the  game  of  Figure  12.4,  reproduced  below: 


2 1 2 0 0 2 
0 13  1 11 

10  0 1 10 


Let  (cr,//)  be  an  assessment  with  cr  = (a,T,f,L)  (highlighted  by  double 
edges;  note  that  cr  is  a subgame-perfect  equilibrium),  ju(b ) > 0 and  //(c)  > 0 . 

(a)  Prove  that  (cr,//)  can  be  rationalized  by  a choice-measurable  plausibility 
order  only  if  //  satisfies  the  following  condition: 

ju(bB)  > 0 if  and  only  if  ju(cBf)  > 0 

(b)  Prove  that  if,  besides  from  being  rationalized  by  a choice-measurable 
plausibility  order  ^ , (cr,//)  is  also  uniformly  Bayesian  relative  to  ^ 
(Definition  12.8),  then  //  satisfies  the  following  condition: 

if  n(bB)>  0 then  = 

/u(bB)  ju(b) 
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000000000000 

Exercise  12.11.  (Challenging  Question).  Consider  the  following  game: 

1 


0 10  0 
0 10  1 
0 110 


Let  (cr,//)  be  the  following  assessment:  cr  = (M, l,a,c,e)  (highlighted  by 

( L M R Li  Mr  Lm  Rr  Mm  Ri 0 
double  edges)  and  u = (the 

8 ' ^^0  100  1 1 0 0 ij 

decision  histories  h such  that  //(//)  = 1 are  denoted  by  large  black  dots  and  the 
decision  histories  h such  that  ju(h)  = 0 are  denoted  by  small  grey  dots). 

(a)  Prove  that  (cr,//)  is  an  independent  perfect  Bayesian  equilibrium 
(Definition  12.5). 

(b)  Using  Theorem  12.5  prove  that  (cr,//)  is  not  a sequential  equilibrium. 
[Hint:  prove  that  (cr,//)  cannot  be  rationalized  by  a choice-measurable 
plausibility  order  (Definition  12.7).] 
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Appendix  12. S:  Solutions  to  exercises 

Exercise  12.1.  From  crwe  get  that  the  plausibility  preserving  actions  are  a,  s, 
d,  e,f  and  g.  Thus  ade  is  as  plausible  as  adf  (by  transitivity,  since  each  of  them  is 
as  plausible  as  ad)  and  each  of  them  is  more  plausible  than  both  acf  [since  (1)  ad 
is  more  plausible  than  ac,  (2)  ade  and  adf  are  as  plausible  as  ad  and  (3)  acf  is  as 
plausible  as  ac]  and  b [since  (1)  a is  more  plausible  than  b and  (2)  ade  and  adf  are 
as  plausible  as  ad,  which  in  turn  is  as  plausible  as  a].  Hence  the  most  plausible 
histories  in  Player  4’s  information  set  are  ade  and  adf . It  follows  that  it  must  be 
that  ju(ade)  > 0 , /u(adf)  > 0 , ju(acf)  = 0 and  p(b)  = 0 . Hence  the  given 
assessment  cannot  be  rationalized  by  a plausibility  order. 

Exercise  12.2.  The  plausibility-preserving  actions  are  s,  c,  d,  e and  g.  Thus  the 

fabscdefgh) 

strategy  prorne  must  be  or  the  rorm  cr  = 

87  v ^0  0 1 p l-p  1 0 1 oj 

with  0 < p < 1.  At  Player  3’s  information  set  ac  and  ad  are  equally  plausible 

and  at  Player  4’s  information  set  ade  and  b are  equally  plausible  and  each  is 

more  plausible  than  acf  and  adf  Thus  the  system  of  beliefs  must  be  of  the  form 

( ac  ad  acf  ade  adf  b) 

u = with  0 < q < 1 and  0 < r < 1. 

yq  \-q  0 r 0 1 — rj 


Exercise  12.3.  Since  D+  nF  = {a,ad,b,bf  ,bg} , Property  B1  of  Definition  12.3 
requires  that  vF(h)  > 0 if  and  only  if  h e {a,ad,b,bf,bg} . Let  vF(d)  = p e (0,1) 
and  vF(b)  = q e (0, 1) . The,  by  Property  B2  of  Definition  12.3,  vF(ad)  = 
vF(a)xcr(d)  = pxl  = p , vF  (bf)  = vF  (b)  x <j(f)  = q x T and  vF{bg)  = . 

.-..I  (a  ad  b bf  bg\ 

vF(b)xcj(g)  = qxj  Thus  vF  = and  the  sum  of  these 

[p  p q jq  jq) 

probabilities  must  be  1:  2p  + 2q  = 1 , that  is  p + q = f . let  I2  = {a,bf,bg}  be 

the  information  set  of  Player  2 and  f = {b,ad,ae}  the  information  set  of  Player 
3.  Then  vF(I2)  = vF  (a)  + vp(bf)  + vF(bg)  = p + jq+j  q =p  + q = \ and 

vF{f)  = vF(b)  + vF(ad)  + vF(ae)  = q + p + 0 = f.  Thus  X = — = 2 p and,  by 

1 '/  ( / ' > 2 

Property  B3,  we  need  this  to  be  equal  to  ju(a)  = \-,  solving  2 p=\  we  get 
p=i.  Similarly,  1 = — = 2 q and,  by  Property  B3,  we  need  this  to  be 
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equal  to  ju(b)  = j ; solving  2q  = f we  get  q=\  (alternatively,  we  could  have 
derived  q = \ from  p = \ and  p + q = y). 


Exercise  12.4.  (a)  The  plausibility-preserving  actions  are  f,A,L,R,£,r  and  D 
and  these  are  precisely  the  actions  that  are  assigned  positive  probability  by  cr. 
Furthermore,  the  most  plausible  actions  in  information  set  {a,b,c}  are  b and  c 
and  the  two  actions  in  information  set  { d,e } are  equally  plausible.  This  is 
consistent  with  the  fact  that  D+  = {0,f,b,c,d,e,er} . Thus  the  properties  of 
Definition  12.2  are  satisfied. 


(b)  By  Property  B1  of  Definition  12.3  the  support  of  vE  must  be 
D+r\E  = {0,/}  and  by  Property  B2  it  must  be  that 

v£(/)  = v£(0)xcr(/)  = v£(0)xl  = v£(0).  Thus  the  only  solution  is 

VE((J))  = VE(f)=\. 


(c)  Since  D+  nF  = {b,c,d,e,er}  , by  Property  B1  of  Definition  12.3  the  support 
of  the  probability  distribution  must  coincide  with  the  set  {b,c,d,e,er} , which 
is  indeed  true  for  both  vF  and  vF  . By  Property  B2,  the  probability  of  er  must 
be  equal  to  the  probability  of  e times  cr(r)  = j and  this  is  indeed  true  for  both 


and 

vF(b) 


By  Property  B3,  the  conditional  probability 
= j must  be  equal  to  p{b ) and  this  is  indeed 
vF(d) 


20 

132 


0 + ^o_  + Jio_ 
v ^132  ^ 132 


45 

132 


vF(a)  + vF(b)  + vF(c) 

true;  similarly,  the  conditional  probability 

vF(d)  + vF(e) 

be  equal  to  p(d)  and  this  is  also  true.  Similar  computations  show  that 


1 

132  T 132 


— = 4 must 

15  4 


vF(a)  + vF(b)  + vF(c) 


= p{b)  and 


vF(d)  + vF(e) 


= <u(d) . 
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One  perfect  Bayesian  equilibrium  is 

f ac  ad  acf  ade  adf ^ 
a = (b,c,e,g),  M=  , i n • 

Let  us  first  verify  sequential  rationality.  For  Player  f,  at  the  root,  b gives  a 
payoff  of  2,  while  a gives  a payoff  of  f;  thus  b is  sequentially  rational.  For 
Player  2,  c gives  a payoff  of  f and  d a payoff  of  0;  thus  c is  sequentially  rational. 
For  Player  3,  e gives  a payoff  of  3,  while  / gives  a payoff  of  2;  thus  e is 
sequentially  rational.  For  Player  f at  his  bottom  information  set,  g gives  a 
payoff  of  (|)3  + (y)3  = 3 and  h gives  a payoff  of  (F)1  + (t)3  = 2 ; thus  g is 
sequentially  rational. 

The  following  plausibility  order  rationalizes  the  above  assessment: 

0,  b most  plausible 

a,ac,ace 

acf,  acfg,  ad,  ade,  adeg 
adf,  adfg,  acfh,  adeh 
adfh  least  plausible 

Name  the  equivalence  classes  of  this  order  El,E2,...,E5  (with  Ex  being  the  top 
one  and  Es  the  bottom  one).  Then  the  following  probability  distributions  on 
the  sets  F.  n ZT  (i  =1,2,3:  note  E.  ^ 0 if  and  only  if  ie  {1,2,3},  since 
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D+  = [0,a,ac,acf,ade])  satisfy  the  properties  of  Definition  12.3:  vE  (0)  = 1, 


r a 

ac ^ 

f acf  ad  ade ' 

d2  = 

1 1 

’ ^3  = 

ill 

V2 

2 J 

V 3 3 3 y 

Another  perfect  Bayesian  equilibrium  is: 


a = ( a,d,e,h ) , 


f ac  ad 

acf  ade  adf ^ 

vO  1 

0 1 0 j 

Sequential  rationality  is  easily  verified.  The  assessment  is  rationalized  by: 


^ 0,a,ad , ade,  adeh  most  plausible^ 

b,  ac,  ace,  adf , adfh 
acf,  acfh,  adeg 

acfg,adfg  least  plausible  y 

Only  the  top  equivalence  class  El  = {0,  a,  ad,  ade,  adeh}  has  a non-empty 

intersection  with  D+  = [0,  a,  ad,  ade] . The  following  probability  distribution 

. ......  (0  a ad  ade\ 

satisfies  the  properties  of  Definition  12.3:  = 


i 

V 4 


4 J 


Exercise  12.6.  The  game  under  consideration  is: 
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We  need  to  show  that  the  following  is  a perfect  Bayesian  equilibrium: 


a = 

f 

s, 

c d 
i i 

* / 
1 2 

g b ' 
1 1 ' 

» M = 

r ac  ad 

i i 

acf  ade  adf  b ' 

2 12  6 

K 

2 2 

3 3 

2 2 ) 

V 2 2 

11  11  11  11/ 

First  we  verify  sequential  rationality.  For  Player  f the  possible  payoffs  are: 
from  s : 3 

from  b : y(3)  + 4(2)  = 2.5 

from  a:  i[i(l)  + f(K2)  + t(l»]  +i [t(3)  + !(H>  + i0»] 

Thus  s is  sequentially  rational.  For  Player  2 the  possible  payoffs  are: 


Thus  both  c and  d are 


from  c:  }(2)  + |(2)  = 2 

from  d : }[K°)  + K6)]  + |[l(2)  + |(l)]  = 2 ’ 
sequentially  rational  and  so  is  any  mixture  of  the  two,  in  particular  the  mixture 


d 


I 

V2 


1 

2 J 


For  Player  3 the  possible  payoffs  are: 

from  e:  |(2)  + |[|(0)  + ^(4)]  = 2 

from  /:  i[i(3)  + i(l)]  + i[i(l)  + i(3)]  = 2 ' 
sequentially  rational  and  so  is  any  mixture  of  the  two,  in  particular  the  mixture 

’ P 


Thus  both  e and  / are 


l 

V3 


2 

3 J 


For  Player  4 the  possible  payoffs  are  as  follows:  From  g: 

U 2)  + Tr(0)  + iT(2)  + £(3)  = -ff  and  from  h:  £(3)  + i(2)  + ^(3)  + ^(2)  =f . 
Thus  both  g and  h are  rational  and  so  is  any  mixture  of  the  two,  in  particular 


the  mixture 


g h 


1 

V 2 


1 

2 J 


For  AGM  consistency,  note  that  all  of  the  actions,  except  a and  b,  are 
plausibility  preserving  and,  furthermore,  all  decision  histories  are  assigned 
positive  probability  by  //.  Thus  there  is  only  one  plausibility  order  that 
rationalizes  the  given  assessment,  namely  the  one  that  has  only  two  equivalence 
classes:  the  top  one  being  {0,5}  and  the  other  one  consisting  of  all  the 
remaining  histories.  For  Bayesian  consistency,  the  probability  distribution  for 
the  top  equivalence  class  is  the  trivial  one  that  assigns  probability  1 to  the  null 
history  0.  Let  E = H\{0,s}  be  the  other  equivalence  class  and  note  that 
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EnD + ={a,ac,ad,acf,ade,adf,b}.  In  order  for  a probability  distribution  vE 
to  satisfy  the  properties  of  Definition  12.3,  the  support  of  vE  must  be  the  set 
{a,ac,ad,acf,ade,adf,b}  (Property  Bl).  Let  vE(a)  = p;  then  by  Property  B2  it 
must  be  that  v E(ac)  = v E(ad)  = -^  (since  <j(c)  = <j{d)  = \), 

vE(acf)  = vE(adf)  = f-  (since  cr(c)  x cr(/)  = cr(d)  x cr(/)  = \ x | = j ) and 
vE(ade)  = f (since  cr(d)x<j(e)  = f-x j = |).  Thus  vE  must  be  of  the  form 


a ac  ad  acf  ade  adf  b 


\P  2 2 3 6 3 Q J 

Furthermore,  by  Property  B3,  it  must  be  that 

*e(6) 

vE  ( acf ) + ve  (ade) + ve  (adf) + yE  (h) 


with  the  sum  equal  to  1,  that  is,  ^ p + q = 1 . 


q 


+i+i+q 


The  solution  to  the  pair  of  equations  ^ p + q = 1 and 

P = q=l3’  yielding 


_ _6_ 

11  * 

=Kb) 

q 


i+i+i+q 


= — IS 
11  15 


V c 


r a ac  ad  acf  ade  adf  b ' 

_6_  _3_  _3_  _2_  J_  _6_ 

V 23  23  23  23  23  23  23  J 


Thus  we  have  shown  that  the  given  assessment  is  AGM-  and  Bayes-consistent 
and  sequentially  rational,  hence  a perfect  Bayesian  equilibrium  . 

Exercise  12.7.  The  extensive  form  is  as  follows: 

1 


Let  cr  = 


r a b 

c 

d 

» P = \ 

f a 

b" 

U 0 

1 

0 

oj 

vl 

following  plausibility  order: 


This  assessment  is  rationalized  by  the 
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f0,a  , ac  most  plausible^ 
b,bc 
ad 
ae 
be 

bd  least  plausible 


which  satisfies  Property  INDX  (since  a<b  and  ac  <bc,  ad  <bd,  ae<be)  but 
fails  Property  IND2  since  b e 1(a)  and  ad  < ae  (implying  that,  conditional  on 
a,  d is  more  plausible  than  e)  but  be  -<  bd  (implying  that,  conditional  on  b,  e is 
more  plausible  than  d). 

Exercise  12.8.  The  game  of  Figure  12.7,  reproduced  below,  provides  such  an 
example: 

1 


Consider  the  assessment  u = (c,d,g)  (highlighted  by  double  edges),  together 


with  the  system  of  beliefs  ju  = 


7, 


ae  be 


0 1 


by  the  plausibility  order 


a b 

V1  0 

^ 0,  c most  plausible 

a,  ad 

b, bd 
be,  beg 
ae,  aeg 

bef 

aef  least  plausible 


. This  assessment  is  rationalized 


which  violates  Property 
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/A®, , since  a < b and  be<ae . On  the  other  hand,  the  above  plausibility  order 
satisfies  Property  IND2  since  (1)  ad  < ae  and  bd  -<  be  and  (2)  aeg  < aef  and 
beg<bef . Furthermore,  //  satisfies  Property  IND3:  trivially,  since  a,beeD + 
but  ae,b  <£  D+  . 

A* 


Exercise  12.9.  The  game  under  consideration  is  the  following: 


Consider  an  arbitrary  assessment  of  the  form  (cr,//)  where  a = (a,g,r,...) , 
//(c)  = //(e)  = jU(ch)  = 0 and  ju{h)  > 0 for  every  other  decision  history  h.  There 
are  plausibility  orders  that  rationalize  (cr,//)  , for  example  the  following: 


^0,<7 

most  plausible^ 

(0,a  most  plausible^ 

b,bg,d,dr 

b,bg,d,dr 

bh,ds 

or 

c,cg,e,er 

c,cg,e,er 

bh,ds 

es 

es 

, ch 

least  plausible  j 

ch  least  plausible 

First  we  show  that  any  plausibility  order  ^ that  rationalizes  (cr,//)  must 
satisfy  the  following  properties: 
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• c ~ e [because,  by  P2  of  Definition  12.2,  eg  ~ er  (since  eg  e I(er ) , 

ju(cg)  > 0 and  ju(er)  >0)  and,  by  PI  of  Definition  12.2,  c ~ eg  and 

e ~ er  (since  both  g and  r are  plausibility  preserving)  and  thus,  by 
transitivity,  c ~ e ], 

• es  < eh  [this  follows  from  P2  of  Definition  12.2,  since  es  and  eh  belong  to 

the  same  information  set  and  ju(es)  > 0 , while  ju(ch)  = 0 ], 

• b ~ d [because,  by  P2  of  Definition  12.2,  bg  ~ dr  (since  bg  e I (dr  ) , 

ju(bg)  > 0 and  ju(dr)  >0)  and,  by  PI  of  Definition  12.2,  b ~ bg  and 

d ~ dr  (since  both  g and  r are  plausibility  preserving)  and  thus,  by 
transitivity,  b ~ d ], 

• bh  ~ ds  [by  P2  of  Definition  12.2,  because  bh  e I(ds) , ju(bh ) > 0 and 

ju(ds)  > 0 . 

Next  we  show  that  no  plausibility  order  that  rationalizes  ( a,ju ) is  choice 
measurable.  Select  an  arbitrary  plausibility  order  that  rationalizes  (a,  ju)  and  let 
F be  an  integer  valued  representation  of  it.  Then  the  following  must  be  true: 

(1)  F (e)  - F (es)  > F (c)  - F (eh)  (because  c ~ e , implying  that  F(c)  = F(e), 
and  es  < eh  , implying  that  F(es)  < F(ch)), 

(2)  F(b)  - F(bh)  = F(d)  - F(ds)  (because  bh  ~ ds  , implying  that 
F(bh)  = F(ds) , and  b ~ d , implying  that  F(b)  = F(d) ), 

Thus  if,  as  required  by  choice  measurability,  F(c)  - F(ch)  = F(b)  - F(bh)  then, 
by  (1)  and  (2),  F(e)- F(es)  > F(d)~  F(ds) , which  violates  choice 

measurability.  It  follows  from  Theorem  12.5  that,  since  any  plausibility 
ordering  that  rationalizes  ( u,/j ) is  not  choice  measurable,  (cr,//)  cannot  be  a 
sequential  equilibrium. 
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Exercise  12.10.  The  game  under  consideration  is  the  following: 


10  0 1 10 

Let  (cr,//)  be  an  assessment  with  a = (a,T,f,L)  , //(h)  > 0 and  //(c)  >0. 

(a)  We  have  to  prove  that  (cr,//)  can  be  rationalized  by  a choice-measurable 
plausibility  order  only  if  ju  satisfies  the  following  condition: 

ju(bB)  > 0 if  and  only  if  ju(cBf ) > 0 . 

Let  ^ be  a choice  measurable  plausibility  order  that  rationalizes  ( a,ju ) and  let 
F be  an  integer-valued  representation  of  ^ that  satisfies  choice  measurability. 
Since  //(h)  > 0 and  //(c)  >0,  by  P2  of  Definition  12.2,  h~c  and  thus 
F(b)  = F(c );  by  choice  measurability,  F (h)  - F (c)  = F (bB)  - F (cB)  and  thus 
F(bB)  = F(cB),  so  that  bB  ~ cB  . Since  cr(/)  > 0 , by  PI  of  Definition  12.2, 
cB  ~ cBf  and  therefore,  by  transitivity  of  F , bB  ~ cBf  . Hence  if  ju(bB)  > 0 
then,  by  P2  of  Definition  12.2,  bB  e Min^{bB,cBf,d}  (for  any  set  S c;  F[ , 

Min^S  = \h  e S : h h',  for  all  h'  e S'} ) and  thus  cBf  e Min {hi?,  cBf,  d}  so  that, 
by  P2  of  Definition  12.2,  ju(cBf  ) > 0 . The  proof  that  if  ju(cBf)  > 0 then 
ju(bB)  > 0 is  analogous. 
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(b)  We  have  to  prove  that  if,  besides  from  being  rationalized  by  a choice- 
measurable  plausibility  order  f , (cr,  //)  is  also  uniformly  Bayesian  relative  to 
f (Definition  12.8),  then  //  satisfies  the  following  condition: 


if  M«)>  0 then  = 

/u{bB)  fu{b) 

Suppose  that  ju(b)  > 0 , //(c)  > 0 (so  that  b ~ c)  and  ju(bB)  > 0 . Let  v be  a full- 
support  common  prior  that  satisfies  the  properties  of  Definition^. 8.  Then,  by 
v(c)  v(cB) 


UB2, 


and,  by 


UB1, 


since  cr(/)  = 1 ; 


v(b)  v(bB ) 

v(cBf)  = v(cB)  x cr(/)  = v(cB) . Let  E be  the  equivalence  class  that  contains  b. 
Then  EnD+  = {b,c}.  Since  vE{-)  = v(- \Ec\D+) , by  B3  of  Definition  12.3, 


/j(b)  = 


v(b) 


v(b)  + v(c) 


and  M(.c)  = 


He) 


v(b)  + v(c) 


so 


that 


M(c)  ...  He) 

Bib)  v(b)  • 


Let  G be  the 


equivalence  class  that  contains  bB.  Then,  since  - by  hypothesis  - ju(bB)  > 0 , it 
follows  from  the  condition  proved  in  part  (a)  that  either  G n D+M={bB,cBf}  or 
Gc\D+  = {bB,cBf,d}  . Since  vG (■)  = v(-  \ Gc\D+)  , by  B3  of  Definition  12.3,  in 


the  former  case  ju(bB)  = 
the  latter 

B(cBf)  = 


v(bB) 


case 


v{bB)  + v(cBf) 
B(bB)  = 


and  ju(cBf)  = 
v(bB) 


v(bB)  + v(cBf ) + v(d) 


HcBf ) 


v(bB)  + v(cBf) 


and  i 


in 


and 


HcBf) 


v(bB)  + v(cBf)  + v(d) 


since  v(cBf)  = v(cB) , 


B(cBf)  v(cB) 
ju(bB)  v(bB) 


. • . . B(cBf)  v{cBf) 

; thus  in  both  cases  = . rience, 

ju(bB)  v(bB) 

and,  therefore,  since  (as  shown  above) 


v(cB)  v(c)  , v(c)  _ //(c)  , , BicBf  ) _ //(c) 

and  , we  have  that 


v(bB)  v(b) 


v(b)  M{b) 


fu{bB)  fu{b) ' 
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Exercise  12.11  (Challenging  Question).  The  game  under  consideration  is: 


(a)  We  have  to  show  that  (cr,//)  is  an  independent  perfect  Bayesian 
equilibrium.  Sequential  rationality  is  straightforward  to  verify.  It  is  also 
straightforward  to  verify  that  the  following  plausibility  order  rationalizes 
(a,//): 
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^ 0,  M , M i most  plausible^ 

R,Rl,Rle 
Mm,  Mme 
Mr,  Mr  a 
L,Ll,Lla 
Rm 

Lm,  Lmc 
Rr,Rrc 
Lr 
Rif 
Mmf 
Lmd 
Rrd 
Mrb 

v Lib  least  plausible  y 

Let  us  check  that  the  above  plausibility  order  satisfies  properties  INDX  and 
IND2 . For  INDl  first  note  that  there  are  no  two  decision  histories  h and  h'  that 
belong  to  the  same  information  set  and  are  such  that  h~  h' ; thus  we  only  need 
to  check  that  if  h and  h'  belong  to  the  same  information  set  and  h < h'  then 
ha  < h'a  for  every  a e A(h) . This  is  indeed  true: 

(1)  M < R and  Ml  < Rl , Mm  -<  Rm  and  Mr  < Rr  , 

(2)  M < L and  Mi<Li  , Mm  < Lm  and  Mr  < Lr , 

(3)  Mr  < Li  and  Mr  a <Lia  and  Mrb  < Lib  , 

(4)  Lm  < Rr  and  Lmc  < Rrc  and  Lmd  A Rrd  , 

(5)  Ri<Mm  and  Rie<Mme  and  Rif  < Mmf . 

For  IND2  first  note  that  there  is  no  decision  history  h that  ha  ~ hb  for 
a,beA(h)  with  a^b;  thus  we  only  need  to  show  that  if  ha<hb  and 
h'  € 1(h)  then  h'a  < h'b . This  is  indeed  true: 

(1)  Ml  < Mm  and  Li  A Lm  and  Ri<  Rm  , 

(2)  Mm  < Mr  and  Lm  < Lr  and  Rm  < Rr , 

and  the  rest  is  trivial,  since  at  the  other  information  sets  there  are  only  two 
actions,  one  of  which  is  plausibility  preserving  and  the  other  is  not. 

Finally,  ju  satisfies  IND3  trivially  since  at  each  information  set  only  one  history 
is  assigned  positive  probability  by  ju.  Thus  we  have  shown  that  (<J,/u)  is  an 
independent  perfect  Bayesian  equilibrium. 
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(b)  To  prove  that  (cr,//)  is  not  a sequential  equilibrium  it  would  not  be 
sufficient  to  prove  that  the  plausibility  order  given  above  is  not  choice 
measurable  (although  it  is  indeed  true  that  it  is  not  choice  measurable),  because 
in  principle  there  could  be  another  plausibility  order  which  is  choice 
measurable  and  rationalizes  (cr,//)  . Thus  we  need  to  show  that  any  plausibility 
order  that  rationalizes  (cr,//)  is  not  choice  measurable.  Let  ^ be  a plausibility 
order  that  rationalizes  (cr,//)  ; then  it  must  satisfy  the  following  properties: 

• Lm  < Rr  (because  they  belong  to  the  same  information  set  and  ju(Lm)  > 0 
while  ju(Rr)  = 0).  Thus  if  F is  any  integer-valued  representation  of  ^ it 
must  be  that 

F(Lm)  < F(Rr) . (f) 

• Mr  < Li  ~ L (because  Mr  and  Li  belong  to  the  same  information  set  and 
//(Mr)  > 0 while  /a{Lt)  = 0;  furthermore,  l is  a plausibility-preserving 
action  since  a(i)  > 0).  Thus  if  F is  any  integer-valued  representation  of  ^ 
it  must  be  that 

F(Mr)  < F(L ) . (2) 

• R~  Ri<  Mm  (because  l is  a plausibility-preserving  action,  Rl  and  Mm 
belong  to  the  same  information  set  and  /a{Rt)>  0 while  ju(Mm)  = 0). 
Thus  if  F is  any  integer-valued  representation  of  ^ it  must  be  that 

F{R)  < F(Mm) . (3) 

Now  suppose  that  ^ is  choice  measurable  and  let  F be  an  integer-valued 
representation  of  it  that  satisfies  choice  measurability  (Property  CM  of 
Definition  12.7).  From  (1)  and  (2)  we  get  that 

F(Lm)  - F(L)  < F(Rr)  - F(Mr)  (4) 

and  by  choice  measurability 

F(Rr)  - F(Mr)  = F(R)  - F(M)  (5) 

It  follows  from  (4)  and  (5)  that 

F(Lm)  - F(L)  < F(R)  - F(M)  (6) 

Subtracting  F(M)  from  both  sides  of  (3)  we  obtain 

F(R)  - F(M)  < F(Mm)  - F(M)  . (7) 

It  follows  from  (6)  and  (7)  that  F(Lm)  - F(L)  < F(Mm ) - F(M)  , which  can  be 
written  as  F(M)-F(L)  < F(Mm)  -F(Lm)  , yielding  a contradiction,  because 
choice  measurability  requires  that  F(M ) - F(L)  = F(Mm ) - F(Lm) . 
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PARTY 


Advanced  Topics  III: 


Incomplete  Information 
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Incomplete  Information : 

Static  Games 

13.1  Interactive  situations  with 
incomplete  information 

An  implicit  assumption  in  game  theory  is  that  the  game  being  played  is 
common  knowledge  among  the  players.  The  expression  “incomplete 
information”  refers  to  situations  where  some  of  the  elements  of  the 
game  (e.g.  the  actions  available  to  the  players,  the  possible  outcomes,  the 
players’  preferences,  etc.)  are  not  common  knowledge.  In  such  situations  the 
knowledge  and  beliefs  of  the  players  about  the  game  need  to  be  made  an 
integral  part  of  the  description  of  the  situation.  Pioneering  work  in  this 
direction  was  done  by  John  Harsanyi  (1967,  1968),  who  (together  with  John 
Nash  and  Reinhard  Selten)  was  the  recipient  of  the  1994  Nobel  prize  in 
economics.  Harsanyi  suggested  a method  for  converting  a situation  of 
incomplete  information  into  an  extensive  game  with  imperfect  information 
(this  is  the  so-called  Harsanyi  transformation).  The  theory  of  games  of 
incomplete  information  has  been  developed  for  the  case  of  von  Neumann- 
Morgenstern  payoffs  and  the  solution  concept  proposed  by  Harsanyi  is 
Bayesian  Nash  equilibrium,  which  is  merely  Nash  equilibrium  of  the  imperfect- 
information  game  that  is  obtained  by  applying  the  Harsanyi  transformation. 

Although  the  approach  put  forward  by  Harsanyi  was  in  terms  of  “types”  of 
players  and  of  probability  distributions  over  types,  we  shall  develop  the  theory 
using  the  so-called  “state-space”  approach,  which  makes  use  of  the  interactive 
knowledge-belief  structures  developed  in  Chapter  8.  In  Chapter  15  we  will 
explain  the  “type-space”  approach  and  show  how  to  convert  one  type  of 
structure  into  the  other. 


471 


GAME  THEORY  - Giacomo  Bonanno 


The  distinction  between  complete  and  incomplete  information  is  not  at  all  the 
same  as  that  between  perfect  and  imperfect  information.  To  say  that  a game  (in 
extensive  form)  has  imperfect  information  is  to  say  that  there  is  at  least  one 
player  who  may  have  to  make  a choice  in  a situation  where  she  does  not  know 
what  choices  were  made  previously  by  other  players.  To  say  that  there  is 
incomplete  information  is  to  say  that  there  is  at  least  one  player  who  does  not 
quite  know  what  game  she  is  playing.  Consider,  for  example,  the  players’ 
preferences.  In  some  situations  it  is  not  overly  restrictive  to  assume  that  the 
players’  preferences  are  common  knowledge:  for  instance,  in  the  game  of  chess, 
it  seems  quite  plausible  to  assume  that  it  is  common  knowledge  that  both 
players  prefer  winning  to  either  drawing  or  losing  (and  prefer  drawing  to 
losing).  But  think  of  a contractual  dispute.  Somebody  claims  that  you  owe  him 
a sum  of  money  and  threatens  to  sue  you  if  you  don’t  pay.  We  can  view  this 
situation  as  a two-player  game:  you  have  two  strategies,  “pay”  and  “not  pay”, 
and  he  has  two  strategies,  “sue  (if  no  payment)”  and  “not  sue  (if  no  payment)”. 
In  order  to  determine  your  best  choice,  you  need  to  know  how  he  will  respond 
to  your  refusal  to  pay.  A lawsuit  is  costly  for  him  as  well  as  for  you;  if  he  is  the 
“tough”  type  he  will  sue  you;  if  he  is  the  “soft”  type  he  will  drop  the  dispute.  If 
you  don’t  know  what  type  he  is,  you  are  in  a situation  of  incomplete 
information. 

As  explained  above,  we  have  a situation  of  incomplete  information  whenever  at 
least  one  of  the  players  does  not  know  some  aspects  of  the  game:  it  could  be  the 
preferences  of  her  opponents,  the  choices  available  to  her  opponents  or  the  set 
of  possible  outcomes.  We  shall  focus  almost  exclusively  on  the  case  where  the 
uncertainty  concerns  the  preferences  of  the  players,  while  everything  else  will 
be  assumed  to  be  common  knowledge  (for  an  exception  see  Exercise  13.2). 
Harsanyi  argued  that  every  situation  can  be  reduced  to  this  case.  For  example, 
he  argued,  if  Player  1 does  not  know  whether  Player  2 has  available  only 
choices  a and  b or  also  choice  c,  we  can  model  this  as  a situation  where  there  are 
two  possible  “states  of  the  world”  in  both  of  which  Player  2 has  three  choices 
available,  a,  b and  c,  but  in  one  of  the  two  states  choice  c gives  an  extremely  low 
payoff  to  Player  2,  so  that  she  would  definitely  not  choose  c. 

The  interactive  knowledge-belief  structures  developed  in  Chapter  8 are 
sufficiently  rich  to  model  situations  of  incomplete  information.  The  states  in 
these  structures  can  be  used  to  express  any  type  of  uncertainty.  In  Chapter  9 we 
interpreted  the  states  in  terms  of  the  actual  choices  of  the  players,  thus 
expressing  uncertainty  in  the  mind  of  a player  about  the  behavior  of  another 
player.  In  that  case  the  game  was  assumed  to  be  common  knowledge  among  the 
players  (that  is,  it  was  the  same  game  at  every  state)  and  what  varied  from  one 
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state  to  another  was  the  choice  of  at  least  one  player.  If  we  want  to  represent  a 
situation  where  one  player  is  not  sure  what  game  she  is  playing,  all  we  have  to 
do  is  interpret  the  states  in  terms  of  games,  that  is,  assign  different  games  to 
different  states. 

We  shall  begin  with  games  in  strategic  form  with  cardinal  payoffs  where  only 
one  player  is  uncertain  about  what  game  is  being  played.  We  shall  use  the 
expression  “one-sided  incomplete  information”  to  refer  to  these  situations. 


13.2  One-sided  incomplete  information 

Let  us  model  the  following  situation:  the  game  being  played  is  the  one  shown 
in  Figure  13.1;  call  this  the  “true”  game. 

2 

L R 


T 

1 

B 

Figure  13.1 

Player  1 knows  that  she  is  playing  this  game,  while  Player  2 is  uncertain  as  to 
whether  she  is  playing  this  game  or  a different  game,  shown  in  Figure  13.2, 
where  the  payoffs  of  Player  1 are  different: 


T 

1 

B 

Figure  13.2 

For  convenience,  let  us  refer  to  the  “true  game”  of  Figure  13.1  as  the  game 
where  Player  1 is  of  type  b and  the  game  of  Figure  13.2  as  the  game  where 


2 
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Player  1 is  of  type  a.  Suppose  that  Player  2 assigns  probability  f to  Player  1 
being  of  type  a and  probability  } to  Player  1 being  of  type  b.  The  description  is 
not  complete  yet,  because  we  need  to  specify  whether  Player  1 has  any 
uncertainty  concerning  the  beliefs  of  Player  2;  let  us  assume  that  the  beliefs  of 
Player  2 are  common  knowledge.  We  also  need  to  specify  whether  Player  2 is 
uncertain  about  the  state  of  mind  of  Player  1;  let  us  assume  that  it  is  common 
knowledge  that  Player  2 knows  that  Player  1 knows  what  game  is  being  played. 
This  is  a long  verbal  description!  A picture  is  worth  a thousand  words  and 
indeed  the  simple  knowledge-belief  structure  shown  in  Figure  13.3  captures  all 
of  the  above.  There  are  two  states,  a and  (3:  a is  interpreted  as  a 

(counterfactual)  state  where  the  game  of  Figure  13.2  is  played,  while  [3  is 
interpreted  as  the  state  where  the  “true”  game  of  Figure  13.1  is  played.  We 
capture  the  fact  that  latter  is  the  “true”  game,  that  is,  the  game  which  is  actually 
played,  by  designating  state  (3  as  “the  true  state”.1 
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1 is  of  type 


Figure  13.3 

It  is  easy  to  check  that,  in  the  knowledge-belief  structure  of  Figure  13.3,  at  state 
f3  all  of  the  elements  of  the  verbal  description  given  above  are  true.  Anything 


1 


This  is  something  that  is  almost  never  done  in  the  literature,  but  it  is  an  important  element  of 
a description  of  a situation  of  incomplete  information:  what  is  the  actual  state  of  affairs? 
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that  is  constant  across  states  is  common  knowledge:  thus  the  payoffs  of  Player  2 


are  common  knowledge,  the  beliefs  of  Player  2,  namely 


a P 


2 

V 3 


are  common 


3 J 


knowledge  and  the  fact  that  if  the  game  being  played  is  the  one  of  Figure  13.2 
(associated  with  state  a)  then  Player  1 knows  that  they  are  playing  that  game 
and  that  if  the  game  being  played  is  the  one  of  Figure  13.1  (associated  with  state 
P)  then  Player  1 knows  that  they  are  playing  that  game,  that  is,  Player  1 knows 
what  game  is  being  played.  Note,  however,  that,  at  the  true  state  P,  Player  1 
knows  more  than  Player  2,  namely  which  of  the  two  games  is  actually  being 
played. 


Remark  13.1.  In  the  situation  illustrated  in  Figure  13.3,  at  every  state  each 
player  knows  his/her  own  payoffs.  It  may  seem  natural  to  make  this  a 
requirement  of  rationality:  shouldn’t  a rational  player  know  her  own 
preferences?  The  answer  is  Yes  for  preferences  and  No  for  payoffs:  a rational 
player  should  know  how  she  ranks  the  possible  outcomes  (her  preferences  over 
outcomes);  however,  it  is  perfectly  rational  to  be  uncertain  about  what 
outcome  will  follow  a particular  action  and  thus  about  one’s  own  payoff.  For 
example,  you  know  that  you  prefer  accepting  a wrapped  box  containing  a 
thoughtful  gift  (payoff  of  1)  to  accepting  a wrapped  box  containing  an  insulting 
gift  (payoff  of  -1):  you  know  your  preferences  over  these  two  outcomes. 
However,  you  may  be  uncertain  about  the  intentions  of  the  gift-giver  and  thus 
you  are  uncertain  about  what  outcome  will  follow  if  you  accept  the  gift:  you 
don’t  know  if  your  payoff  will  be  1 or  -1.  Thus,  although  you  know  your 
payoff  function,  you  may  be  uncertain  about  what  your  payoff  will  be  if  you 
accept  the  gift.  This  example  is  developed  in  Exercise  13.2. 


Figure  13.3  illustrates  a situation  that  has  to  do  with  games,  but  is  not  a game. 
Harsanyi’s  insight  was  that  we  can  transform  that  situation  into  an  extensive- 
form  game  with  imperfect  information,  as  follows.  We  start  with  a chance 
move,  where  Nature  chooses  the  state;  then  Player  1 is  informed  of  Nature’s 
choice  and  makes  her  decision  between  T and  B ; Player  2 then  makes  his  choice 
between  L and  R,  without  being  informed  of  Nature’s  choice  (to  capture  her 
uncertainty  about  the  game)  and  without  being  informed  of  Player  l’s  choice 
(to  capture  the  simultaneity  of  the  game).  The  game  is  shown  in  Figure  13.4 
below. 
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Player  1 is  Nature  player  1 is 


Figure  13.4 

The  reduction  of  the  situation  illustrated  in  Figure  13.3  to  the  extensive-form 
game  of  Figure  13.4  is  called  the  Harsanyi  transformation.  Once  a game  has  been 
obtained  by  means  of  the  Harsanyi  transformation,  it  can  be  solved  using,  for 
example,  the  notion  of  Nash  equilibrium.  The  Nash  equilibria  of  a game 
corresponding  to  a situation  of  incomplete  information  are  called  Bayesian  Nash 
equilibria.  Note  that  these  are  nothing  more  than  Nash  equilibria:  the  extra 
term  ‘Bayesian’  is  merely  a hint  that  the  game  being  solved  is  meant  to 
represent  a situation  of  incomplete  information. 

Note  that  the  Harsanyi  transformation  involves  some  loss  of  information:  in 
particular,  in  the  resulting  game  one  can  no  longer  tell  what  the  true  state  is, 
that  is,  what  the  actual  game  being  played  is. 

To  find  the  Bayesian  Nash  equilibria  of  the  game  of  Figure  13.4  we  can 
construct  the  corresponding  strategic-form  game,  which  is  shown  in  Figure  13.5 
below.2 


Recall  that,  when  constructing  the  strategic-form  game  associated  with  an  extensive-form 
game  with  chance  moves,  payoffs  are  expected  payoffs:  see  Chapter  6. 
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Figure  13.5 


There  are  two  pure-strategy  Bayesian  Nash  equilibria:  (( T,B),R ) and 

(( B,T),L ) . How  should  we  interpret  them? 

Let  us  consider,  for  example,  the  Bayesian  Nash  equilibrium  (( T ,B),R ) . This  is 
a Nash  equilibrium  of  the  game  of  Figure  13.4;  however,  we  assumed  that  the 
true  state  was  /?,  where  Player  1 is  of  type  b and  thus,  in  the  actual  game  (the 
one  associated  with  state  /?),  the  actual  play  is  ( B,R ),  which  is  not  a Nash 
equilibrium  of  that  game  (because,  while  B is  a best  reply  to  R,  R is  not  a best 
reply  to  B).  This  is  not  surprising,  since  Player  1 knows  that  she  is  playing  that 
game,  while  Player  2 attaches  only  probability  | to  that  game  being  the  actual 
game.  Thus  the  first  observation  is  that  a Bayesian  Nash  equilibrium  of  a “game 
of  incomplete  information”  does  not  imply  that  the  players  play  a Nash 
equilibrium  in  the  actual  (or  “true”)  game.  The  second  observation  has  to  do 
with  Player  l’s  strategy.  By  definition,  Player  l’s  strategy  in  the  game  of  Figure 
13.4  consists  of  a pair  of  choices,  one  for  the  case  where  she  is  informed  that 
her  type  is  a and  the  other  for  the  case  where  she  is  informed  that  her  type  is  b: 
in  the  Bayesian  Nash  equilibrium  under  consideration,  Player  l’s  strategy  ( T,B ) 
means  “Player  1 plays  Tif  of  type  a and  plays  B if  of  type  b”.  However,  Player 
1 knows  that  she  is  of  type  b : she  knows  what  game  she  is  playing.  Why,  then, 
should  she  formulate  a plan  on  how  she  would  play  in  a counterfactual  world 
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where  her  type  was  a (that  is,  in  a counterfactual  game)?  The  answer  is  that 
Player  l’s  strategy  ( T,B ) is  best  understood  not  as  a contingent  plan  of  action 
formulated  by  Player  f,  but  as  a complex  object  that  incorporates  (f)  the  actual 
choice  (namely  to  play  B)  made  by  Player  f in  the  game  that  she  knows  she  is 
playing  and  (2)  a belief  in  the  mind  of  Player  2 about  what  Player  f would  do 
in  the  two  games  that,  as  far  as  Player  2 knows,  are  actual  possibilities. 

An  alternative  (and  easier)  way  to  find  the  Bayesian  Nash  equilibria  of  the 
game  of  Figure  13.4  is  to  use  the  notion  of  weak  sequential  equilibrium  for  that 
game.  For  example,  to  verify  that  (( B,T),L ) is  a Nash  equilibrium  we  can 
reason  as  follows:  If  the  strategy  of  Player  f is  ( B,T ),  then  Bayesian  updating 
requires  Player  2 to  have  the  following  beliefs:  probability  \ on  the  second 
node  from  the  left  and  probability  } on  the  third  node  from  the  left.  Given 
these  beliefs,  playing  L yields  him  an  expected  payoff  of  f(3)  + }(3)  = 3 while 
playing  R yields  him  an  expected  payoff  of  f(0)  + }(9)  = 3;  thus  any  strategy 
(pure  or  mixed)  is  sequentially  rational:  in  particular  L is  sequentially  rational. 
If  Player  2 plays  L then  at  her  left  node  Player  1 gets  0 with  T and  3 with  B,  so 
that  B is  sequentially  rational;  at  her  right  node  Player  1 gets  6 with  T and  3 
with  B,  so  that  T is  sequentially  rational.  Hence  ((5,T),Z)  with  the  stated 

beliefs  is  a weak  sequential  equilibrium,  implying  that  ((5,T),Z)  is  a Nash 
equilibrium  (Theorem  10.1,  Chapter  10). 

Definition  13.1.  In  a “game  of  one-sided  incomplete  information”  a pure- 
strategy  Bayesian  Nash  equilibrium  where  the  “informed”  player  makes  the 
same  choice  at  every  singleton  node  is  called  a pooling  equilibrium,  while  a pure- 
strategy  Bayesian  Nash  equilibrium  where  the  “informed”  player  makes 
different  choices  at  different  nodes  is  called  a separating  equilibrium. 

Thus  in  the  game  of  Figure  13.4  there  are  no  pooling  equilibria:  all  the  pure- 
strategy  Bayesian  Nash  equilibria  are  separating  equilibria. 
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Remark  13.2.  Although  we  have  only  looked  at  the  case  of  two  players, 
situations  of  one-sided  incomplete  information  can  involve  any  number  of 
players,  as  long  as  only  one  player  is  uncertain  about  the  game  being  played 
(while  all  the  other  players  are  not).  An  example  of  a situation  of  one-sided 
incomplete  information  with  three  players  will  be  given  in  the  next  chapter 
with  reference  to  Selten’s  chain-store  game  (which  we  studied  in  Chapter  2, 
Section  2.4). 

We  conclude  this  section  with  an  example  of  a situation  of  one-sided 
incomplete  information  involving  a two-player  game  where  each  player  has  an 
infinite  number  of  strategies.  This  is  an  incomplete-information  version  of 
Cournot’s  game  of  competition  between  two  firms  (Chapter  1,  Section  1.7). 
This  example  involves  the  use  of  calculus  and  some  readers  might  want  to  skip 
it. 

Consider  a Cournot  duopoly  (that  is,  an  industry  consisting  of  two  firms, 
which  compete  in  output  levels)  with  inverse  demand  given  by 

P(Q)  = 34-0, 

where  Q-  q{+q2  is  total  industry  output  (ql  is  the  output  of  Firm  1 and  q2  is 
the  output  of  Firm  2).  It  is  common  knowledge  between  the  two  firms  that 
Firm  l’s  cost  function  is  given  by: 


C{q) = 6q, 

Firm  2’ s cost  function  is 

C2(q 2)  = 9 q. 

Firm  2 knows  this,  while  Firm  1 believes  that  Firm  2’s  cost  function  is 
C2(g7)  = 9 q2  with  probability  } and  C’2(c/2)  = 3 qi  with  probability  \ (Firm  2 

could  be  a new  entrant  to  the  industry  using  an  old  technology,  or  could  have 
just  invented  a new  technology).  Firm  1 knows  that  Firm  2 knows  its  own  cost 
function.  Everything  that  Firm  1 knows  is  common  knowledge  between  the 
two  firms.  Thus  we  have  the  situation  of  one-sided  incomplete  information 
illustrated  in  Figure  13.6. 
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Figure  13.6 


Let  us  find  a Bayesian  Nash  equilibrium  of  the  game  obtained  by  applying  the 
Harsanyi  transformation  to  this  situation  of  incomplete  information  and 
compare  it  to  the  Nash  equilibrium  of  the  complete  information  game  where 
Firm  2’s  cost  function  is  common  knowledge. 

In  the  complete  information  case  where  it  is  common  knowledge  that  Firm  2’s 
cost  function  is  C’2(c/2)  = 9 q2,  the  profit  (=  payoff)  functions  of  the  two  firms 
are  given  by: 

7ri(<h->,h)  = [34 -(^  +q2')\<h  -6q, 

^2(^N2)  = [34-(^l  +^2)]^2 

The  Nash  equilibrium  is  given  by  the  solution  to  the  following  pair  of 
equations: 


d”j(gpg2) 

dq, 


34-2<?j  -q2  -6  = 0 


dn2(q„q2) 

d<h 


= 34-^ -2  q2 


-9  = 0 


The  solution  is  qx  = y-  = 10.33  and  q2  = -y  = 7.33  . 
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Next  we  go  back  to  the  incomplete-information  situation  illustrated  in  Figure 
13.6.  Although  it  is  not  possible  to  draw  the  extensive-form  game  that  results 
from  applying  the  Harsanyi  transformation  (because  of  the  infinite  number  of 
possible  output  levels)  we  can  nevertheless  sketch  the  game  as  shown  in  Figure 
13.7  (where  H means  “high  cost”  and  L “low  cost”): 

Nature 


To  find  a Bayesian  Nash  equilibrium  it  is  easiest  to  think  in  terms  of  weak 
sequential  equilibrium:  if  the  strategy  profile  ((^,^2)^1)  Part  °f  a weak 

sequential  equilibrium,3  then  - by  sequential  rationality  - q2  must  maximize 
the  expression  [(34 -q2 )q2  ~9q2 ] and  q2  must  maximize  the  expression 
[(34 -q2)q2  -3q2]  ; furthermore,  by  Bayesian  updating,  Firm  1 must 
assign  probability  } to  the  node  following  choice  q"  and  probability  f to  the 
node  following  choice  q2  so  that  qx  must  maximize  the  expression 
}[(34-<7j  -q2)qx  -6^]  + f [(34-^  -q2)q]  -6  qx\.  Thus  [j^q2  ,q2  ),^  j must  be  a 
solution  to  the  following  system  of  three  equations: 


3 


is  the  output  chosen  by  Firm  2 at  the  left  node,  where  its  cost  is  high,  and  q!;  is  the  output 
chosen  by  Firm  2 at  the  right  node,  where  its  cost  is  low. 
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TT-K34-?, -9«"]=0, 

5^2 

A[(34-<?,-«2Ik2I-3«2I]=0> 

dq2 

■f(  it(34  -9,  - 9?  )<?,  - 69,  ] +f  [(34  - 9,  - 92  )9,  - 69,  ] ) = 0. 

There  is  a unique  solution  given  by 

<7,  =9,  qf  =8,  #=11. 

Since  the  true  state  is  where  Firm  2’s  cost  is  high,  the  actual  output  levels  are 
qx=  9 for  Firm  1 and  q2  = 8 for  Firm  2.  Thus  in  the  incomplete  information 
case  Firm  2’s  output  is  higher  than  in  the  complete  information  case  and  Firm 
l’s  output  is  lower  than  in  the  complete  information  case. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced 
in  this  section,  by  going  through  the  exercises  in  Section  13. E. 2 of  Appendix 
13. E at  the  end  of  this  chapter. 


13.3  Two-sided  incomplete  information 

Let  us  now  consider  a situation  involving  two  players,  both  of  whom  face  some 
uncertainty.  Such  a situation  is  called  a two-sided  incomplete-information 
situation.  Note  that  for  such  a situation  to  arise,  it  is  not  necessary  that  both 
players  are  uncertain  about  some  “objective”  aspect  of  the  game  (such  as  the 
preferences  of  the  opponent):  one  of  the  two  players  might  simply  be  uncertain 
about  the  beliefs  of  the  other  player.  A situation  of  this  sort  is  illustrated  in 
Figure  13.8.  Let  G be  the  game  associated  with  states  a and  f (it  is  the  same 
game)  and  G'  the  game  associated  with  state  y.  Suppose  that  the  true  state  is  a. 
Then  all  of  the  following  are  true  at  state  a: 

1)  both  Player  1 and  Player  2 know  that  they  are  playing  game  G (that  is, 
neither  player  has  any  uncertainty  about  the  objective  aspects  of  the 
game), 

2)  Player  1 knows  that  Player  2 knows  that  they  are  playing  game  G, 
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3)  Player  2 is  uncertain  as  to  whether  Player  1 knows  that  they  are  playing 
G (which  is  the  case  if  the  actual  state  is  a)  or  whether  Player  1 is 
uncertain  (if  the  actual  state  is  /?)  between  the  possibility  that  they  are 
playing  game  G and  the  possibility  that  they  are  playing  game  G'  and 
considers  the  two  possibilities  equally  likely;  furthermore,  Player  2 
attaches  probability  f to  the  first  case  (where  Player  1 knows  that  they 
are  playing  game  G)  and  probability  } to  the  second  case  (where  Player 
1 is  uncertain  between  game  G and  game  G'), 

4)  Player  1 knows  the  state  of  uncertainty  of  Player  2 (concerning  Player  1, 
as  described  in  point  3 above), 

5)  The  payoffs  of  Player  1 are  common  knowledge;  furthermore,  it  is 
common  knowledge  that  Player  2 knows  his  own  payoffs. 


2 2 2 
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Figure  13.8 


In  principle,  the  Harsanyi  transformation  can  be  applied  also  to  situations  of  two- 
sided  incomplete  information.  However,  in  such  cases  there  is  an  issue  concerning 
the  probabilities  with  which  Nature  chooses  the  states.  In  the  case  of  one-sided 
incomplete  information  we  can  take  those  probabilities  to  be  the  beliefs  of  the 
uninformed  player,  but  in  the  case  of  two-sided  incomplete  information  we  have 
two  uninformed  players  and  thus  two  sets  of  beliefs.  For  example,  if  we  look  at 
state  J3  in  Figure  13.8,  we  have  two  different  probabilities  assigned  to  that  state:  \ 


483 


GAME  THEORY  - Giacomo  Bonanno 


by  Player  1 and  } by  Player  2.  Which  of  the  two  should  we  take  as  Nature’s 

probability  for  state  /??  The  answer  is:  neither  of  them.  What  we  should  take  as 
Nature’s  probabilities  is  a probability  distribution  over  the  set  of  states  {a,/3,y} 
that  reflects  the  beliefs  of  both  players:  we  have  encountered  such  a notion  before, 
in  Chapter  8:  it  is  the  notion  of  a common  prior  (Definition  8.10).  A common  prior 
is  a probability  distribution  over  the  set  of  states  that  yields  the  players’  beliefs 
upon  conditioning  on  the  information  represented  by  a cell  of  an  information 
partition.  For  example,  in  the  situation  illustrated  in  Figure  13.8  we  are  seeking  a 
probability  distribution  v : {a,/?,y}  — > [ 0,1]  such  that  v((3  \ {J3,y})  = 

T~~~~  - = }>  v(a\{a,j3})=  = j (and,  of  course,  v(a)  + v{P)  + 

v(j3)  + v(y)  v(a)  + v(j3) 

v{y)  = 1).  In  this  case  a common  prior  exists  and  is  given  by 
v(a)  = i and  v(/3)  = v(y)  = Using  this  common  prior  to  assign  probabilities  to 
Nature’s  choices  we  obtain  the  imperfect-information  game  shown  in  Figure  13.9. 


NATURE 


1 0 031003100  3 

3 1 0 13101020  1 


Figure  13.9 
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The  following  pure-strategy  profile  is  Bayesian  Nash  equilibrium  of  the  game 
of  Figure  13.9:  Player  l’s  strategy  is  AB  (that  is,  he  plays  A if  informed  that  the 
state  is  a and  plays  B if  informed  that  the  state  is  either  /3  or  y)  and  Player  2’s 
strategy  is  CD  (that  is,  she  plays  C at  her  information  set  on  the  left  and  D at 
her  information  set  on  the  right).  To  verify  that  this  is  a Bayesian  Nash 
equilibrium  it  is  easier  to  verify  that  it  is  a weak  sequential  equilibrium  together 
with  the  following  system  of  belief,  obtained  by  using  Bayesian  updating  (note 
that  every  information  set  is  reached  by  the  strategy  profile): 
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Let  us  check  sequential  rationality.  We  begin  with  Player  1:  (1)  at  the  singleton 
node  on  the  left,  A gives  Player  1 a payoff  of  1 (given  Player  2’s  choice  of  C) 
while  B gives  him  a payoff  of  0,  hence  A is  sequentially  rational;  (2)  at  the 
information  set  on  the  right,  given  his  beliefs  and  given  Player  2’s  strategy  CD, 
choosing  A gives  him  an  expected  payoff  of  -yl  + 40  = y while  B gives  him  an 
expected  payoff  of  40  + ^3=f,  hence  B is  sequentially  rational.  Now  consider 
Player  2:  (1)  at  her  information  set  on  the  left,  given  her  beliefs,  C gives  her  an 
expected  payoff  of  j3  + }0  = 2 while  D gives  her  an  expected  payoff  of 
■|l  + jl  = l,  hence  C is  sequentially  rational;  (2)  at  her  information  set  on  the 
right,  given  her  beliefs,  C gives  her  a payoff  of  0 while  D gives  her  a payoff  of  1, 
hence  D is  sequentially  rational. 

The  existence  of  a common  prior  is  essential  in  order  to  be  able  to  apply  the 
Harsanyi  transformation  to  a situation  of  two-sided  incomplete  information.  In 
some  cases  a common  prior  does  not  exist  (see  Exercise  13.6)  and  thus  the 
Harsanyi  transformation  cannot  be  carried  out. 

Remark  13.3.  Besides  the  conceptual  issues  that  arise  in  general  with  respect  to 
the  notion  of  Nash  equilibrium,  the  notion  of  Bayesian  Nash  equilibrium  for 
games  with  incomplete  information  raises  the  further  issue  of  how  one  should 
understand  or  justify  the  notion  of  a common  prior.  This  issue  is  not  a trivial 
one  and  has  been  the  object  of  debate  in  the  literature.4 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced 
in  this  section,  by  going  through  the  exercises  in  Section  13. E. 3 of  Appendix 
13. E at  the  end  of  this  chapter. 


4 See,  for  example,  Bonanno  and  Nehring  (1999),  Gul  (1998)  and  Morris  (1995). 
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13.4  Multi-sided  incomplete  information 

So  far  we  have  only  considered  strategic-form  games  with  two  players. 
However,  the  analysis  extends  easily  to  games  involving  more  than  two  players. 
If  there  are  n > 3 players  and  only  one  player  has  uncertainty  about  some 
aspects  of  the  game,  while  the  others  have  no  uncertainty  whatsoever,  then  we 
have  a situation  of  one-sided  incomplete  information;  if  two  or  more  players 
have  uncertainty  (not  necessarily  all  about  the  game  but  some  possibly  about 
the  beliefs  of  other  players)  then  we  have  a multi-sided  incomplete-information 
situation  (the  two-sided  case  being  a special  case).  We  will  consider  an  example 
of  this  below.  Let  G,  and  G2  be  the  three-player  games  in  strategic  form  (with 
cardinal  payoffs)  shown  in  Figure  13.10. 


Figure  13.10 
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Consider  the  multi-sided  situation  of  incomplete  information  illustrated  in 
Figure  13.11,  where  with  each  state  is  associated  one  of  the  two  games  of  Figure 
13.10. 

game  game  game  game 

G . G , G2  G 

1:  a [ P T f Y ] 5 

V ' 


T 

true 

state 


Figure  13.11 

At  the  true  state  a,  all  of  the  following  are  true: 

• all  three  players  know  that  they  are  playing  game  G, , 

• Player  1 knows  that  Players  2 and  3 know  that  the  actual  game  is  Gl ; 

• Player  2 knows  that  Player  3 knows  that  the  actual  game  is  Gl , but  is 
uncertain  as  to  whether  Player  1 knows  or  is  uncertain;  furthermore, 
Player  2 assigns  probability  f to  Player  1 being  uncertain; 

• Player  3 knows  that  Player  1 knows  that  the  actual  game  is  G, , but  is 
uncertain  as  to  whether  Player  2 knows  or  is  uncertain;  furthermore, 
Player  3 assigns  probability  § to  Player  2 being  uncertain; 

• The  payoffs  of  Players  1 and  3 are  common  knowledge. 
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The  beliefs  of  the  players  in  the  situation  illustrated  in  Figure  13.11  are 
compatible  with  each  other,  in  the  sense  that  there  exists  a common  prior.  In 


fact  the  following  is  a common  prior:  v = 


a (3  y 8 


3_ 
V 17 


_6_ 

17 


_6_ 

17 


_2 

17  y 


. Thus  we  can  apply 


the  Harsanyi  transformation  and  obtain  the  game  shown  in  Figure  13.12. 


Figure  13.12 


The  task  of  finding  Bayesian  Nash  equilibria  for  the  game  of  Figure  13.12  is  left 
as  an  exercise  (see  Exercise  13.7). 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced 
in  this  section,  by  going  through  the  exercises  in  Section  13. E. 4 of  Appendix 
13. E at  the  end  of  this  chapter. 
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Appendix  13. E:  Exercises 

13. E. 2.  Exercises  for  Section  13.2: 

One-sided  incomplete  information. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  13.1.  Albert  and  Bill  play  a Game  of  Chicken.  Each  player  can  choose 
“swerve”  or  “don’t  swerve.”  Choices  are  made  simultaneously.  If  a player 
swerves  then  he  is  a chicken.  If  he  does  not  swerve,  then  he  is  a rooster  if  the 
other  player  swerves,  but  he  gets  injured  if  the  other  player  does  not.  A player 
can  be  a “normal”  type  or  a “reckless”  type.  A “normal”  type  gets  a payoff  of 
100  from  being  a rooster,  85  from  being  a chicken,  and  zero  from  being  injured. 
A “reckless”  type  thinks  being  a rooster  is  worth  100,  being  injured  is  worth  50, 
but  being  a chicken  is  worth  zero!  As  a matter  of  fact,  both  Albert  and  Bill  are 
normal;  it  is  common  knowledge  between  them  that  Bill  is  normal,  but  Bill 
thinks  that  there  is  a 20%  chance  that  Albert  is  reckless  (and  an  80%  chance 
that  Albert  is  normal).  Bill  knows  that  Albert  knows  whether  he  (=  Albert)  is 
reckless  or  normal.  Everything  that  Bill  knows  is  common  knowledge  between 
Albert  and  Bill. 

(a)  Construct  an  interactive  knowledge-belief  structure  that  represents  the 
situation  of  incomplete  information  described  above. 

(b)  Apply  the  Harsanyi  transformation  to  obtain  an  extensive-form  game. 

(c)  Construct  the  strategic-form  associated  with  the  extensive-form  game  of 
part  (b). 

(d)  Find  the  pure-strategy  Bayesian  Nash  equilibria  and  classify  them  as  either 
pooling  or  separating. 

Exercise  13.2.  Bill  used  to  be  Ann’s  boyfriend.  Today  it  is  Ann’s  birthday.  Bill 
can  either  not  give  a gift  to  Ann  or  give  her  a nicely  wrapped  gift.  If  offered  a 
gift,  Ann  can  either  accept  or  reject.  This  would  be  a pretty  simple  situation,  if 
it  weren’t  for  the  fact  that  Ann  does  not  know  if  Bill  is  still  a friend  or  has 
become  an  enemy.  If  he  is  a friend,  she  expects  a nice  present  from  him.  If  Bill 
is  an  enemy,  she  expects  to  find  a humiliating  thing  in  the  box  (he  is  known  to 
have  given  dead  frogs  to  his  “not  so  nice  friends”  when  he  was  in  third  grade!). 


489 


GAME  THEORY  - Giacomo  Bonanno 


The  preferences  are  as  follows:  Bill’s  favorite  outcome  (payoff  = f)  occurs 
when  he  offers  a gift  and  it  is  accepted  (in  either  case:  if  he  is  a friend,  he  enjoys 
seeing  her  unwrap  a nice  present,  and  if  he  is  an  enemy,  he  revels  in  the  cruelty 
of  insulting  Ann  with  a humiliating  “gift”).  Whether  he  is  a friend  or  an  enemy, 
Bill  prefers  having  not  extended  a gift  (payoff  = 0)  to  enduring  the  humiliation 
of  a rejected  gift  (payoff  = -f).  Ann  prefers  accepting  a gift  coming  from  a 
friend  (payoff  = f)  to  refusing  a gift  (payoff  = 0);  the  worst  outcome  for  her 
(payoff  = -1)  is  one  where  she  accepts  a gift  from  an  enemy.  Ann  attaches 
probability  (with  0 < p < f)  to  the  event  that  Bill  is  a friend  (and  f -p  to  Bill 
being  an  enemy);  however,  Ann  knows  that  Bill  knows  whether  he  is  a friend 
or  an  enemy.  Everything  that  Ann  knows  is  common  knowledge  between  Ann 
and  Bill.  As  a matter  of  fact,  Bill  is  a friend. 

(a)  Construct  an  interactive  knowledge-belief  structure  that  represents  the 
situation  of  incomplete  information  described  above. 

(b)  Apply  the  Harsanyi  transformation  to  obtain  an  extensive-form  game. 

(c)  Construct  the  strategic-form  associated  with  the  extensive-form  game. 

(d)  Find  all  the  pure-strategy  Bayesian  Nash  equilibria  and  classify  them  as 
either  pooling  or  separating. 

(e)  Suppose  that  p = Is  the  outcome  associated  with  a pure-strategy 
Bayesian  Nash  equilibrium  Pareto  efficient? 


13. E. 3.  Exercises  for  Section  13.2: 

Two-sided  incomplete  information. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  13.3.  Consider  the  situation  of  two-sided  incomplete  information 
illustrated  below  (where  the  true  state  is  a). 
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Use  the  Harsanyi  transformation  to  represent  this  incomplete-information 
situation  as  an  extensive-form  game.  Be  explicit  about  how  you  calculated  the 
probabilities  for  Nature. 


Exercise  13.4.  Consider  the  following  congestion  situation  (a  variant  of  the  so 
called  El  Farol  Bar  Problem).5  Two  students  at  a college  are  simultaneously 
deciding  between  going  to  a bar  and  going  home.  The  bar  is  extremely  small 
and  it  gets  congested  when  more  than  one  person  is  there.  In  principle,  there 
are  two  types  of  students.  One  type,  call  it  the  b type,  prefers  going  to  the  bar  if 
he  is  the  only  customer  there  (in  which  case  he  gets  a utility  of  20)  but  dislikes 
congestion  and  gets  a utility  of  -20  if  he  goes  to  the  bar  and  is  not  the  only 
customer  there;  furthermore,  for  the  b type,  the  utility  of  being  at  home  is  0. 
The  other  type  of  student,  call  him  the  not-b  type,  prefers  being  at  home  (in 
which  case  his  utility  is  20)  to  being  at  the  bar;  being  at  the  bar  alone  gives  him 
a utility  of  0 but  being  at  the  bar  with  other  customers  is  very  stressful  and 
gives  him  a utility  of  -40.  Let  G(bx,b2)  be  the  game  where  both  players  are  b 
types,  G(bx,not-b2)  the  game  where  Player  1 is  a b type  and  Player  2 a not-b 
type,  etc.  Assume  that  all  payoffs  are  von  Neumann-Morgenstern  payoffs. 


(a)  Write  the  four  possible  strategic-form  games. 


5 See  https://en.wikipedia.org/wiki/El  Farol  Bar  problem 
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(b)  (b.l)  For  each  of  the  games  of  part  (a)  find  the  pure-strategy  Nash 
equilibria. 

(b.2)  For  game  G{bx,b2 ) find  also  a mixed-strategy  equilibrium  where  each 
choice  is  made  with  positive  probability. 

(c)  Assume  now  that,  as  a matter  of  fact,  both  players  are  b types.  However,  it 

is  a situation  of  incomplete  information  where  it  is  common  knowledge 
that  each  player  knows  his  own  type  but  is  uncertain  about  the  type  of  the 
other  player  and  assigns  probability  } to  the  other  player  being  the  same 
type  as  he  is  and  probability  j to  the  other  player  being  of  the  opposite 
type.  Draw  an  interactive  knowledge-belief  structure  that  represents  this 
situation  of  incomplete  information. 

(d)  Use  the  Harsanyi  transformation  to  represent  the  above  situation  of 
incomplete  information  as  an  extensive-form  game. 

(e)  For  the  game  of  part  (d),  pick  one  strategy  of  Player  1 and  explain  in  words 

what  it  means. 

(f)  For  the  game  of  part  (d),  write  the  corresponding  strategic-form  game. 

(g)  Find  a pure-strategy  Bayesian  Nash  equilibrium  of  the  game  of  part  (d). 

(h)  For  the  Bayesian  Nash  equilibrium  of  part  (g),  find  (h.l)  where  the  players 

are  actually  going  and  (h.2)  the  actual  payoffs  of  the  players  in  the  game 
they  are  actually  playing  (that  is,  at  the  true  state),  (h.3)  Do  their  actual 
choices  yield  a Nash  equilibrium  of  the  game  that  they  are  actually 
playing? 

(i)  If  you  didn’t  know  what  the  true  state  was  but  you  knew  the  game  of  part 

(d),  what  probability  would  you  assign  to  the  event  that  the  players  would 
end-up  making  actual  choices  that  constitute  a Nash  equilibrium  of  the 
true  game  that  they  are  playing? 
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Exercise  13.5.  Consider  the  situation  of  two-sided  incomplete  information 
illustrated  below  (where  the  true  state  is  a). 


L 

R 

L 

R 

L 

R 

L 

R 

T 

4,4 

0,0 

T 

0,4 

4,0 

T 

0,4 

4,0 

T 

4,4 

0,0 

1 

1 

1 

1 

B 

2,0 

2,2 

B 

2,0 

0,2 

B 

2,0 

0,2 

B 

2,0 

2,2 

1: 


2: 


© 


a 


• 1/2 

1/2  • 

P 


# 1/4 

3/4  • 

— 

Y 


© 


• 1/4 

3/4  • 

(a)  Use  the  Harsanyi  transformation  to  represent  this  situation  as  an  extensive- 
form  game.  Be  explicit  about  how  you  calculated  the  probabilities  for 
Nature. 

(b)  Write  down  all  the  pure  strategies  of  Player  1 and  all  the  pure  strategies  of 
Player  2. 

(c)  Consider  the  following  pure-strategy  profile:  Player  1 plays  T always  and 
Player  2 plays  L always.  What  belief  system,  paired  with  this  strategy  profile, 
would  satisfy  Bayesian  updating? 
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13. E. 4.  Exercises  for  Section  13.4: 

Multi-sided  incomplete  information. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  13.6.  Consider  the  multi-sided  situation  of  incomplete  information 
shown  below  (what  games  Gl  and  G2  are  is  irrelevant  to  the  following 
question).  For  what  values  of  p can  the  Harsanyi  transformation  be  carried  out? 


game  game  game  game 


Exercise  13.7.  Find  a pure-strategy  Bayesian  Nash  equilibrium  of  the  game  of 
Figure  13.12. 


494 


GAME  THEORY  - Giacomo  Bonanno 


000000000000 

Exercise  13.8:  Challenging  Question. 

Consider  the  following  situation.  It  is  common  knowledge  between  Players  1 
and  2 that  tomorrow  one  of  three  states  will  occur:  a,  b or  c.  It  is  also  common 
knowledge  between  them  that  (1)  if  state  a materializes,  then  Player  1 will  only 
know  that  either  a or  b occurred  and  Player  2 will  only  know  that  either  a or  c 
occurred,  (2)  if  state  b materializes,  then  Player  1 will  only  know  that  either  a 
or  b occurred  and  Player  2 will  know  that  b occurred,  (3)  if  state  c materializes, 
then  Player  1 will  know  that  c occurred  while  Player  2 will  only  know  that 
either  a or  c occurred.  Tomorrow  they  will  play  the  following  simultaneous 
game:  each  will  report,  confidentially,  one  of  her  two  possible  states  of 
information  to  a third  party  (for  example,  Player  1 can  only  report 
{a,b}  or  {c}).  Note  that  lying  is  a possibility:  for  example,  if  the  state  is  a 
Player  1 can  choose  to  report  {c} . Let  i?,  be  the  report  of  Player  1 and  R2  the 
report  of  Player  2.  The  third  party,  who  always  knows  the  true  state,  then 
proceeds  as  follows: 

(1)  if  the  reports  are  compatible,  in  the  sense  that  i?,  n R2  ^ 0 , then  he  gives  the 
players  the  following  sums  of  money: 

If  the  true  state  is  a If  the  true  state  is  b If  the  true  state  is  c 


/?,  n /?2  = 

a 

b 

c 

Rx  n R2  = 

a 

b 

c 

R1  n R2  = 

a 

b 

c 

1 gets 

5 

4 

~6 

1 gets 

5 

4 

~4 

1 gets 

0 

1 

2 gets 

5 

6 

4 

2 gets 

0 

1 

1 

2 gets 

5 

4 

4 

(2)  if  the  reports  are  incompatible  (RlnR2  = 0)  then  he  gives  the  players  the 
following  sums  of  money: 


The  true  state  is 

a 

b 

c 

1 gets 

5 

4 

~7 

2 gets 

5 

1 

4 

(a)  Represent  this  situation  of  incomplete  information  by  means  of  an 
interactive  knowledge  structure  (for  the  moment  do  not  worry  about 
beliefs). 

(b)  Apply  the  Harsanyi  transformation  to  the  situation  represented  in  part  (a) 
to  obtain  an  extensive-form  frame  (again,  at  this  stage,  do  not  worry  about 
probabilities). 
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(c)  Suppose  first  that  both  players  have  no  idea  what  the  probabilities  of  the 
states  are  and  are  not  willing  to  form  subjective  probabilities.  It  is  common 
knowledge  that  each  player  is  selfish  (i.e.  only  cares  about  how  much 
money  she  herself  gets)  and  greedy  (i.e.  prefers  more  money  to  less)  and 
ranks  sets  of  outcomes  according  to  the  worst  outcome,  in  the  sense  that 
she  is  indifferent  between  sets  X and  Y if  and  only  if  the  worst  outcome  in 
X is  equal  to  the  worst  outcome  in  Y and  prefers  X to  Y if  and  only  if  the 
worst  outcome  in  X is  better  than  the  worst  outcome  in  Y. 

(c.l)  Write  the  normal-form  (or  strategic-form)  of  the  game  of  Part  (a). 

(c.2)  Find  all  the  pure-strategy  Nash  equilibria  of  this  game. 

(c.3)  Among  the  Nash  equilibria,  is  there  one  where  each  player  tells  the 
truth? 


(d)  Suppose  now  that  it  is  common  knowledge  between  the  players  that  there 


are  objective  probabilities  for  the  states  as  follows: 


a b 


2 

V 5 


2 

5j 


. This  time 


assume  that  it  is  common  knowledge  that  both  players  are  selfish,  greedy 
and  risk-neutral.  (Thus  ignore  now  the  preferences  described  in  part  c.) 


(d.l)  Suppose  that  Player  2 expects  Player  1 to  report  truthfully.  Is  it 
rational  for  Player  2 to  also  report  truthfully? 


(d.2)  Is  “always  lying”  for  each  player  part  of  a pure-strategy  weak 
sequential  equilibrium?  Prove  your  claim. 
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Appendix  13. S:  Solutions  to  exercises 


Exercise  13.1.  (a)  The  structure  is  as  follows  (s  means  “swerve”,  ds  means 
“don’t  swerve”): 


Bill 


Bill 


s ds  s ds 


s 

0 ,85 

0 ,100 

5 

85,85 

85,100 

Albert 

Albert 

ds 

100,85 

50,0 

ds 

100,85 

0 , 0 

Albert  is  reckless 

1 : © 


Albert  is  normal 


a 0.2 


t 

true  state 

(b)  The  Harsanyi  transformation  yields  the  following  game: 


NATURE 
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(c)  The  corresponding  strategic-form  game  is  as  follows: 


A s’s 

1 

b 


r 

t ds,  ds 


(d)  There  is  only  one  pure-strategy  Bayesian  Nash  equilibrium,  namely 
(( ds,ds),s ) and  it  is  a pooling  equilibrium. 


Bill 


s ds 


(0.8)85  =68,85 

(0.8)85  = 68 , 100 

(0.8)100  =80,85 

0,(0.2)100  =20 

(0.8)85  +(0.2)100  = 88 , 85 

(0.8)85  + (0.2)50  = 78  , (0.8)100  = 80 

100 , 85 

(0.2)50  = 10 , 0 

Exercise  13.2.  (a)  The  structure  is  as  follows  (g  means  “gift”,  ng  means  “no 
gift”,  a means  “accept”  and  r means  “reject”): 

Ann 

a r 

g 

Bill 

ng 

Bill  is  an  enemy 


Ann 

a r 


1,  -1 

1 

o 

o 

o 

o 

o 

a p 


t 


true  state 
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(b)  The  Harsanyi  transformation  yields  the  following  game: 

Nature 


1-10  1-1  0 

10  0-10  0 


(c)  The  corresponding  strategic-form  game  is  as  follows: 

Ann 


a r 


(d)  If  p - i then  there  are  two  pure-strategy  Bayesian  Nash  equilibria:  ((g,g),  a) 
and  ((ng,ng),  r).  Both  of  them  are  pooling  equilibria.  If  p < \ then  there  is 
only  one  Bayesian  -Nash  equilibrium:  ((ng,ng),  r)  (a  pooling  equilibrium). 

(e)  If  p = \ the  only  Bayesian-Nash  equilibrium  is  (( ng,ng ),  r)  and  the  outcome 
is  that  Bill  does  not  offer  a gift  to  Ann.  This  outcome  is  Pareto  inefficient  in 
the  true  game  because,  given  that  the  true  state  of  affairs  is  one  where  Bill  is 
a friend,  a Pareto  superior  outcome  would  be  one  where  Bill  offers  a gift 
and  Ann  accepts. 
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Exercise  13.3.  (a)  and  (b.l): 


Player  2 


G(h,b2y. 

Player  1 


B 

H 

B 

-20 

-20 

20 

0 

H 

0 

20 

0 

0 

Nash  equilibria:  (H,B)  and  (B,H) 


G{bx,not-b2) : 

Player  1 

Nash  equilibrium:  (B,H) 


Player  2 


B 

H 

B 

-20 

-40 

20 

20 

H 

0 

0 

0 

20 

G(no\-bx,b2): 

Player  1 

Nash  equilibrium:  (H,B) 


Player  2 


B 

H 

B 

-40 

-20 

0 

0 

H 

20 

20 

20 

0 

G(not-hj,not-h2) : 

Player  1 

Nash  equilibrium:  (H,H)- 


Player  2 


B 

H 

B 

-40 

-40 

0 

20 

H 

20 

0 

20 

20 
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(b.2)  Let  p be  the  probability  that  Player  1 plays  B and  q the  probability  that 
Player  2 plays  B.  Then  the  mixed-strategy  equilibrium  is  given  by  the  solution 
to  -20  q + 20(1  - q)  = 0 and  -20  p + 20(1  - p)  = 0 , which  is  p = q=  \. 


(c)  The  structure  is  as  follows: 

1: 


• 1/5 

4/5  • 

, — * 

bi,bs 


2: 


4/5# 


1/5  • 


bi,not-b2  not-bi,b2  not-bi,not-bs 


(d)  First  of  all,  note  that  the  common  prior  is 


bx , not-/i0 

not  -bx,b2 

not-/), , not-h. 

1 

4 

4 

1 

v To 

To 

To 

To 

Thus  the  extensive  form  is  as  follows: 
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NATURE 


-20  20  0 0 -20  20  0 0 -40  0 20  20  -40  0 20  20 

-20  0 20  0 -40  20  0 20  -20  0 20  0 -40  20  0 20 


(e)  One  possible  strategy  for  Player  1 is  (, B,H)  which  means  “if  I am  the  b type 
then  I go  to  the  bar  and  if  I am  the  not-b  type  then  I go  home”. 

(f)  The  strategic  form  is  as  follows: 

B,B 

Player  B>H 

1 H,B 
H,H 
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(g)  There  is  a unique  pure-strategy  Nash  equilibrium,  namely  (( B,H),(B,H) ) 

where  each  player  goes  to  the  bar  if  he  is  a b type  and  goes  home  if  he  is  a not-b 
type.  This  can  be  found  either  the  long  way,  by  filling  in  all  the  payoffs  in  the 
above  matrix,  or  by  reasoning  as  follows.  For  each  player,  going  home  is 
strictly  better  than  going  to  the  bar  if  the  player  is  of  type  not-b,  no  matter 
what  she  anticipates  the  other  player  doing  (in  other  words,  H strictly 
dominates  B at  the  information  set  where  not- A holds).  Thus  the  question  is 
what  to  do  if  you  are  of  type  b.  You  know  that  the  other  player  is  going  home 
if  he  is  of  type  not-b,  thus  you  only  need  to  consider  his  choice  if  he  is  of  type 
b;  if  his  plan  is  to  go  home  also  in  that  case,  then  B gives  you  20  for  sure  and  H 
gives  you  0 for  sure,  hence  B is  better;  if  his  plan  is  to  go  to  the  bar,  then  H 

(-20  20T 

gives  you  0 for  sure  while  B gives  you  the  lottery  | , .1,  that 


i 


is,  an 


expected  payoff  of  12;  hence  B is  better  in  that  case  too. 


(h)  At  the  true  state,  both  players  prefer  going  to  the  bar,  thus  (h.l)  they  both 
end  up  going  to  the  bar  and  (h.2)  get  a payoff  of  -20.  (h.3)  ( B,B ) is  not  a Nash 
equilibrium  of  the  game  that  they  are  actually  playing  (game  G{bx,b2)). 


(i)  If  the  true  game  is  G(b \,b2)  they  end  up  playing  ( B,B ) which  is  not  a Nash 
equilibrium  of  that  game,  if  the  true  game  is  G(bt, not-b2)  they  end  up  playing 
( B,H)  which  is  a Nash  equilibrium  of  that  game,  if  the  true  game  is  G{not-bx,b2) 
they  end  up  playing  ( H,B ) which  is  a Nash  equilibrium  of  that  game  and  if  the 
true  game  is  G(not-hj,not-h2)  they  end  up  playing  ( H,H)  which  is  a Nash 
equilibrium  of  that  game.  Since  the  probability  of  G(bx,b2 ) is  , the 
probability  that  they  end-up  playing  a Nash  equilibrium  of  the  actual  game  is 

_9_ 

10  ' 
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Exercise  13.4.  The  game  is  as  follows: 


NATURE 


P y 


T / \ B 


L A 1 

\ J 

/ V 

T/ 

/ \ B 

\ 

/ 

\ 

f 2 


A A A — f\  TX — f\ 

X \ t \ x\x\x\x\ 

6 03  3 0 330033  0 

3 93  0 3 930124  3 

T)  P 

Nature’s  probabilities  are  obtained  by  solving  — = j , — 

Pa+Pp  ' Pp+Py 

Pa+Pp+Py=l- 

Exercise  13.5.  (a)  The  game  is  as  follows: 

NATURE 


R L 


0 4 

2 0 
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Nature’s  probabilities  are  obtained  solving  the  following  equations: 


Pa  _ 1 Pp  - 1 ; Py 
Pa+Pp  4’  Pfi+Pr  2’  Py+Ps 


b Pa+Pp+Pr+Ps=l- 


(b)  Player  1 has  8 strategies:  TTT,  TTB,  TBT,  TBB,  BTT,  BTB,  BBT,  BBS. 
Player  2 has  four  strategies:  LL,  LR,  RL,  RR. 


(c)  Player  f’s  beliefs  must  be  j at  the  left  node  and  \ at  the  right  node  of  his 
middle  information  set.  Player  2’s  beliefs  at  her  information  set  on  the  left 
must  be:  \ at  the  left-most  node  and  at  the  third  node  from  the  left.  The 
same  is  true  for  the  other  information  set  of  Player  2. 


Exercise  13.6.  The  Harsanyi  transformation  requires  that  there  be  a common 
prior.  Thus  we  need  a probability  distribution  v : \a,P,y,8\  — > (0,1)  such  that: 


(1) 


Kg) 

v(a)  + v(/3) 


= }>  (2) 


and  (4) 


Hr) 

v(r)+v(S) 
v(a) 


= b P) 


HP) 

HP)  + Hr) 


_ i 

3 


= P 


v{a)  + v{S) 

From  (1)  we  get  v(P)  = Av(a) , from  (2)  we  get  v(y)  = 2 v(S)  and  from  (3)  we 
get  v(y)  = 2 v(8)  ; these  three  equalities,  together  with  a+P+y+S=  1 yield  a 


unique  solution,  namely  v = 
v(a) 


a P y 8 

J_  A.  JL  A. 

V 17  17  17  17  J 


. This  is  a common  prior  if  and 


only  if  p = 


— 17  _ 1 


17  T 17 


v{a)  + v{8) 

possible  to  apply  the  Harsanyi  transformation. 


j . Thus  p = } is  the  only  value  that  makes  it 


Exercise  13.7.  The  game  under  consideration  is  reproduced  below,  with  the 
equilibrium  described  below  highlighted  in  red.  Given  the  complexity  of  the 
game,  it  is  definitely  not  a good  idea  to  construct  the  corresponding  strategic 
form.  It  is  best  to  think  in  terms  of  weak  sequential  equilibrium.  Consider  the 
assessment  (cr,//),  highlighted  in  the  figure  above,  where  a = (BBB,CC,EEE^ 

(that  is,  cr  is  the  pure-strategy  profile  where  Player  1 chooses  B at  each  of  his 
three  information  sets,  Player  2 chooses  C at  each  of  her  two  information  sets 
and  Player  3 chooses  E at  each  of  her  three  information  sets)  and  //  is  the 
following  system  of  beliefs:  Player  1 attaches  probability  \ to  each  node  in  his 
bottom  information  set;  Player  2,  at  her  information  set  on  the  left,  attaches 
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probability  j to  the  second  node  from  the  top  and  probability  -f  to  the  bottom 
node  and,  at  her  information  set  on  the  right,  attaches  probability  to  the 
second  node  from  the  top  and  probability  f to  the  bottom  node;  Player  3,  at 
her  top  information  set,  attaches  probability  } to  the  left-most  node  and 
probability  J-  to  the  second  node  from  the  right  and,  at  her  bottom-left 
information  set,  attaches  probability  1 to  the  third  node  from  the  left  and,  at 
her  bottom-right  information  set,  attaches  probability  1 to  the  left-most  node. 

400042  2 0 4 22  04000 

100210  4 1 1 04  11002 

000002  2 0 0 2200000 


Let  us  verify  that  (cr,//)  is  a weak  sequential  equilibrium.  The  beliefs  described 
above  are  obtained  from  a using  Bayesian  updating.  Thus  we  only  need  to 
check  sequential  rationality. 

For  Player  1:  (f)  at  the  top-left  node,  both  A and  B give  the  same  payoff 
(namely,  4),  thus  B is  sequentially  rational;  (2)  the  same  is  true  at  the  top- 


506 


GAME  THEORY  - Giacomo  Bonanno 


right  node;  (3)  at  the  bottom  information  set  both  A and  B give  an  expected 
payoff  of  -j(4)  + -j(4)  = 4,  thus  B is  sequentially  rational. 

For  Player  2:  (f)  at  the  left  information  set  C gives  an  expected  payoff  of 
y(l)  + j(l)  = 1 while  D gives  y (0)  + f (0)  = 0 , thus  C is  sequentially  rational; 
(2)  at  the  information  on  the  right  C gives  an  expected  payoff  of 
y (1)  + j (4)  = 3 while  D gives  an  expected  payoff  of  } (0)  + f (1)  = f , thus  C is 
sequentially  rational. 

For  Player  3:  (f)  at  the  top  information  set  both  E and  F give  an  expected 
payoff  of  f(0)  + f(0)  = 0 , thus  E is  sequentially  rational;  (2)  at  the  bottom- 
left  information  set  both  E and  F give  a payoff  of  0,  thus  E is  sequentially 
rational;  (3)  the  same  is  true  at  the  bottom-right  information  set. 


Exercise  13.8  (Challenging  Question),  (a)  The  structure  is  as  follows: 


Player  1: 


Player  2: 

a 

b 

c 

(b)  The  game  is  as  follows: 
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(c)  (c.l)  The  strategic  form  is  as  follows  (where  the  strategy  (x, y)  for  Player  1 
means  x if  {a,b}  and  y if  {c}  and  the  strategy  (z,w)  for  Player  2 means  z if 
{a,c}  and  w if  {b}).  Inside  each  cell  the  sets  of  outcomes  are  given: 
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({ a,c},{a,c }) 

Player 

({a,c},{b}) 

2 ! 

{{b},{a,c})  j ({&},{&}) 

({ a,b},{a,b }) 

{(5, 5),  (5,0),  (0,5)} 

yi 

Lh 

3 

3 

Lh 

! 

{(4, 6),  (5,0),  (1,4)}  j {(4, 6),  (4,1),  (1,4)} 

PI  ({a,b},{c}) 

{(5, 5),  (5,0),  (1,4)} 

3 

3 

</T 



{(4, 6),  (5,0),  (1,4)}  ! {(4, 6),  (4,1),  (1,4)} 

1 ({c},{a,b}) 

{(6, 4), (4,1), (0,5)} 

{(6, 4),  (4,1),  (0,5)} 

{(5, 5),  (4,1), (1,4)}  | {(5, 5), (4,1), (1,4)} 

(M,M) 

{(6, 4),  (4,1), (1,4)}  | {(6, 4),  (4,1),  (1,4)}  j {(5, 5),  (4,1), (1,4)}  j {(5, 5), (4,1), (1,4)} 

Taking  as  payoffs  the  smallest  sum  of  money  in  each  cell  (for  the  corresponding 
player)  the  game  can  be  written  as  follows: 


({a,c},{a,c}) 

Player  2 J 

{{a,c},{b})  j {{b},{a,c})  j ({£},{£}) 

({a,b},{a,b}) 

0 , 0 

! 

O 

— 

O 

— 

PI  ({a,b},{c}) 

1 , 0 

1,1  : 1,0  i 1 , 1 

1 ({c}, {«,£}) 

0 , 1 

0,1  ! 1,1  ! 1 , 1 

1,1  j 1,1  j 1,1  j 1 , 1 

(c.2)  There  are  9 Nash  equilibria  which  are  highlighted  in  red. 

(c.3)  Truth  telling  is  represented  by  the  strategy  profile 

(({a,b},{c}),({a,c},{b}))  and  it  is  one  of  the  Nash  equilibria. 

(d)  (d.l)  No.  If  the  state  is  b then  it  is  a good  idea  for  Player  2 to  report 
truthfully  because  {a,c}  yields  her  0 while  [b]  yields  her  1.  But  if  the  state  is 
either  a or  c then,  by  Bayesian  updating,  Player  2 must  assign  probability  \ to 
the  left-most  node  and  probability  \ to  the  right-most  node  of  her  bottom 
information  set;  thus  her  expected  payoff  from  reporting  {a,c}  is 
y(5)  + 4(4)  = 4.5  while  the  expected  payoff  from  reporting  {b}  is 

2(6)  + i(4)  = 5. 

(d.2)  Yes.  “Always  lie”  corresponds  to  the  strategy  profile 

(({c}, {a, £}),({£}, {a, c})) . By  Bayesian  updating  the  corresponding  beliefs  must 

be:  for  Player  f (f,j)  and  for  Player  2 (0,y,7,0)  at  the  bottom  information  set 

and  (0,1)  at  the  top  information  set.  Sequential  rationality  is  then  satisfied  at 
every  information  set:  for  Player  1 at  the  top  information  set  {c}  gives  an 
expected  payoff  of  |(5)  + }(4)  = ^ while  {a,b}  gives  |(4)  + 1 (5)  = ^ and  at  the 
singleton  node  on  the  right  {a,b}  gives  1 and  so  does  {c };  for  Player  2 at  the 
bottom  information  set  {b}  gives  an  expected  payoff  of  |(5)  + |(4)  = 4.5  and 
{a,c}  gives  ^(4)  + f-(5)  = 4.5  and  at  the  top  information  set  both  {a,c}  and  {b} 
give  1. 
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Incomplete  Information: 

Dynamic  Games 

14.1  One-sided  incomplete  information 

At  the  conceptual  level,  situations  of  incomplete  information  involving 
dynamic  (or  extensive-form)  games  are  essentially  the  same  as 
situations  involving  static  games:  the  only  difference  in  the 
representation  is  that  one  would  associate  with  every  state  a dynamic  game 
instead  of  a static  game.6  As  in  the  case  of  static  games,  we  will  distinguish 
between  one-sided  and  multi-sided  incomplete  information.  In  this  section  we 
will  go  through  two  examples  of  the  former,  while  the  latter  will  be  discussed 
in  Section  14.2. 

2 
? 


0 

? 

Figure  14.1 

6 The  reader  might  have  noticed  that  in  Exercise  13.2  we  “sneaked  in”  a dynamic  game  (where 
Bill  moved  first  and  decided  whether  or  not  to  offer  a gift  and  Ann  -if  offered  a 
gift  - decided  whether  or  not  to  accept  it) . However,  the  game  could  also  have  been 
written  as  a simultaneous  game,  where  Ann  decided  whether  or  not  to  accept  before  knowing 
whether  Bill  would  offer  a gift  (without  knowing  Ann’s  decision). 


1 

? 
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Consider  the  perfect-information  frame  shown  in  Figure  14.1,  where  the 
question  marks  in  place  of  Player  2’s  payoffs  indicate  that  her  payoffs  are  not 
common  knowledge  between  the  two  players.  Suppose  that,  on  the  other  hand, 
Player  l’s  payoffs  are  common  knowledge.  Furthermore,  Player  1 is  uncertain 
between  the  two  possibilities  shown  in  Figure  14.2  and  assigns  probability  } to 
the  one  on  the  left  and  probability  f to  the  one  on  the  right.  The  thick  edges 
represent  the  backward-induction  solutions  of  the  two  games. 


Figure  14.2 

Thus  if  Player  1 knew  that  he  was  playing  the  game  on  the  left,  then  he  would 
choose  B,  while  if  he  knew  that  he  was  playing  the  game  on  the  right  then  he 
would  choose  T.  If  we  assume  that  Player  l’s  beliefs  are  common  knowledge 
between  the  players,  then  we  have  the  situation  of  one-sided  incomplete 
information  shown  in  Figure  14.3.  Assume  that  state  a is  the  true  state. 


Figure  14.3 
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Using  the  Harsanyi  transformation  we  can  convert  the  situation  illustrated  in 
Figure  14.3  into  the  extensive-form  game  shown  in  Figure  14.4. 


NATURE 


2 1 12 

Figure  14.4 

The  solution  concept  used  for  the  case  of  static  games  was  Bayesian  Nash 
equilibrium  (that  is,  Nash  equilibrium).  In  the  case  of  dynamic  games  this  is  no 
longer  an  appropriate  solution  concept,  since  - as  we  know  from  Chapter 
3 - it  allows  a player  to  “choose”  a strictly  dominated  action  at  an  unreached 
information  set.  To  see  this,  consider  the  strategic-form  game  associated  with 
the  game  of  Figure  14.4,  shown  in  Figure  14.5. 


Player  1 


Player  2 


DD  DA  AD  AA 


1 

0 

1 

0 

1 

0 

1 

0 

0 

4/3 

4/3 

2 

2/3 

1 

2 

5/3 

Figure  14.5 


The  Nash  equilibria  of  the  strategic-form  game  shown  in  Figure  14.5  (and  thus 
the  Bayesian  Nash  equilibria  of  the  game  shown  in  Figure  14.4)  are:  (B , DD), 
( B , AD)  and  (T,  DA).  Of  these  only  ( T,DA ) is  a subgame-perfect  equilibrium. 
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Thus  from  now  on  we  shall  use  either  the  notion  of  subgame-perfect 
equilibrium  or  the  notion  of  weak  sequential  equilibrium  (in  the  game  of 
Figure  14.4  the  two  notions  coincide).  Thus  we  will  take  ( T,DA ) to  be  the 
solution  of  the  game  of  Figure  14.4  Since  we  postulated  that  the  true  game  was 
the  one  associated  with  state  a,  as  a matter  of  fact  the  players  end  up  playing 
( T,D ) which  is  not  the  backward  induction  solution  of  the  true  game.  This  is 
not  surprising  in  light  of  what  was  already  pointed  out  in  Chapter  13. 

Next  we  consider  a more  complex  example,  built  on  Selten’s  Chain-Store  Game 
analyzed  in  Chapter  2 (Section  2.4).  Recall  that  the  story  is  as  follows  (we  will 
consider  the  special  case  where  m,  the  number  of  towns  and  thus  of  potential 
entrants,  is  2).  A chain  store  is  a monopolist  in  an  industry.  It  owns  stores  in 
two  different  towns.  In  each  town  the  chain  store  makes  $5  million  if  left  to 
enjoy  its  privileged  position  undisturbed.  In  each  town  there  is  a 
businesswoman  who  could  enter  the  industry  in  that  town,  but  earns  $1 
million  if  she  chooses  not  to  enter;  if  she  decides  to  enter,  then  the  monopolist 
can  either  fight  the  entrant,  leading  to  zero  profits  for  both  the  chain  store  and 
the  entrant  in  that  town,  or  it  can  accommodate  entry  and  share  the  market 
with  the  entrant,  in  which  case  both  players  make  $1.5  million  in  that  town. 
Thus,  in  each  town  the  interaction  between  the  incumbent  monopolist  and  the 
potential  entrant  is  as  illustrated  in  Figure  14.6  below. 


Potential 
entrant  in 


Incumbent 


share 


out 


fight 


>•  1.5 
1.5 


T 

i 

5 


T 

0 

0 


Figure  14.6 


Decisions  are  made  sequentially,  as  follows.  At  date  t (t  = 1,2)  the 
businesswoman  in  town  t decides  whether  or  not  to  enter  and  if  she  enters  then 
the  chain  store  decides  whether  or  not  to  fight  in  that  town.  What  happens  in 
town  t (at  date  t ) becomes  known  to  everybody.  Thus  the  businesswoman  in 
town  2 at  date  2 knows  what  happened  in  town  1 at  date  1 (either  that  there 
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was  no  entry  or  that  entry  was  met  with  a fight  or  that  entry  was 

accommodated).  Intuition  suggests  that  in  this  game  the  threat  by  the 

Incumbent  to  fight  early  entrants  might  be  credible,  for  the  following  reason. 
The  Incumbent  could  tell  Businesswoman  1 the  following: 

“It  is  true  that,  if  you  enter  and  I fight,  I will  make  zero 
profits,  while  by  accommodating  your  entry  I would  make 
$1.5  million  and  thus  it  would  seem  that  it  cannot  be  in  my 
interest  to  fight  you.  However,  somebody  else  is  watching  us, 
namely  Businesswoman  2.  If  she  sees  that  I have  fought  your 
entry  then  she  might  fear  that  I would  do  the  same  with  her 

and  decide  to  stay  out,  in  which  case  in  town  2 I would 

make  $5  million,  so  that  my  total  profits  in  towns  1 and  2 
would  be  $(0  + 5)  = $5  million.  On  the  other  hand,  if  I 
accommodate  your  entry,  then  she  will  be  encouraged  to 
entry  herself  and  I will  make  $1.5  million  in  each  town,  for  a 
total  profit  of  $3  million.  Hence,  as  you  can  see,  it  is  indeed 
in  my  interest  to  fight  you  and  thus  you  should  stay  out.” 

We  showed  in  Chapter  2 that  the  notion  of  backward  induction  does  not 
capture  this  intuition.  In  the  game  depicting  the  entire  interaction  (Figure  2.10, 
Chapter  2)  there  was  a unique  backward-induction  solution  whose 
corresponding  outcome  was  that  both  businesswomen  entered  and  the 
Incumbent  accommodated  entry  in  both  towns.  The  reason  why  the  backward- 
induction  solution  did  not  capture  the  “reputation”  argument  outlined  above 
was  explained  in  Chapter  2.  We  remarked  there  that,  in  order  to  capture  the 
reputation  effect,  one  would  need  to  allow  for  some  uncertainty  in  the  mind  of 
some  of  the  players.  This  is  what  we  will  show  below. 

Suppose  that,  in  principle,  there  are  two  types  of  incumbent  monopolists:  the 
rational  type  and  the  hotheaded  type.  The  payoffs  of  a rational  type  are  as 
shown  in  Figure  14.6,  while  a hotheaded  type  enjoys  fighting;  Figure  14.8 
summarizes  the  payoffs  for  the  two  types  of  Incumbent  (while  there  is  only  one 
type  of  potential  entrant,  whose  payoffs  are  as  shown  earlier). 
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If  the  Incumbent  is  rational: 

potential  entrant 


Incumbent 


in 

out 

fight 

0 

0 

5 
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share 

1.5 

1.5 
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1 

If  the  Incumbent  is  hotheaded : 

potential  entrant 


Incumbent 


in 

out 

fight 

2 

0 

5 

1 

share 

1.5 

1.5 

5 

1 

Figure  14.7 


Thus  a hotheaded  Incumbent  enjoys  fighting  entry  (he  considers  a fight  to  be  as 
good  as  getting  $2  million!). 

Consider  the  following  situation  of  one-sided  incomplete  information.  As  a 
matter  of  fact,  the  Incumbent  is  rational  and  this  fact  is  common  knowledge 
between  the  Incumbent  and  Potential  Entrant  1 (from  now  on  denoted  by  PE- 
1).  However,  Potential  Entrant  2 (PE-2)  is  uncertain  whether  the  Incumbent  is 
rational  or  hotheaded  and  attaches  probability  p to  the  latter  case.  The  beliefs  of 
PE-2  are  common  knowledge  as  are  the  payoffs  of  PE-1  and  PE-2.  This 
situation  can  be  illustrated  with  the  knowledge-belief  structure  of  Figure  14.8 
below. 
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Figure  14.8 


Applying  the  Harsanyi  transformation  to  the  situation  depicted  in  Figure  14.8 
yields  the  extensive-form  game  shown  in  Figure  14.9  below.  We  want  to  see  if 
this  situation  of  one-sided  incomplete  information  can  indeed  capture  the 
reputation  effect  discussed  above;  in  particular,  we  want  to  check  if  there  is  a 
weak  sequential  equilibrium  of  the  game  of  Figure  14.9  where  the  Incumbent’s 
strategy  would  be  to  fight  PE-l’s  entry  in  order  to  scare  off  PE-2  and,  as  a 
consequence,  PE-1  decides  to  stay  out  and  so  does  PE-2. 
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1.5  0 1.5  0 

3.5  4 1.5  0 

Figure  14.9 

Using  the  notion  of  weak  sequential  equilibrium  allows  us  to  simplify  the 
game,  by  selecting  the  strictly  dominant  choice  for  the  Incumbent  at  each  of  his 
singleton  nodes  followed  only  by  terminal  nodes.  It  is  straightforward  to  check 
that  at  such  nodes  a hotheaded  Incumbent  would  choose  “fight”  while  a 
rational  Incumbent  would  choose  “share”.  Thus  we  can  delete  those  decision 
nodes  and  replace  them  with  the  payoff  vectors  associated  with  the  optimal 
choice.  The  simplified  game  is  shown  in  Figure  14.10  below. 
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0 | i.j  i 

4 7 1-5  5 


Figure  14.10 

Consider  the  following  pure-strategy  profile,  call  it  cr,  for  the  simplified  game 

of  Figure  14.10  (it  is  highlighted  in  dark  pink  in  Figure  14.10): 

(1)  PE-l’s  strategy  is  “out”  at  both  nodes, 

(2)  PE-2’s  strategy  is  “out”  at  the  top  information  set  (after  having  observed  that 
PE-1  stayed  out),  “in”  at  the  middle  information  set  (after  having  observed 
that  PE-l’s  entry  was  followed  by  the  Incumbent  sharing  the  market  with 
PE-1)  and  “out”  at  the  bottom  information  set  (after  having  observed  that 
PE-l’s  entry  was  followed  by  the  Incumbent  fighting  against  PE-1), 

(3)  the  Incumbent’s  strategy  is  to  fight  entry  of  PE-1  in  any  case  (that  is, 
whether  the  Incumbent  himself  is  hotheaded  or  rational). 

We  want  to  show  that  cr,  together  with  the  system  of  beliefs  //  shown  in  Figure 

14.10  (where  at  her  top  information  PE-2  assigns  probability  p to  the 

Incumbent  being  hotheaded,  at  her  middle  information  set  she  assigns 
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probability  1 to  the  Incumbent  being  rational  and  at  her  bottom  information 
set  she  assigns  probability  1 to  the  Incumbent  being  hotheaded)  constitutes  a 
weak  sequential  equilibrium  for  any  value  of  p > } . Bayesian  updating  is 

satisfied  at  PE-2’s  top  information  set  (the  only  information  set  reached  by  cr), 
and  at  the  other  two  information  sets  of  PE-2  any  beliefs  are  allowed  by  the 
notion  of  weak  sequential  equilibrium.  Sequential  rationality  also  holds: 

• For  PE-1,  at  the  left  node  “in”  yields  0 and  “out”  yields  1,  so  that  “out”  is 
sequentially  rational;  the  same  is  true  at  the  right  node. 

• For  the  Incumbent,  at  the  left  node  “fight”  yields  7 and  “share”  yields  3.5, 
so  that  “fight”  is  sequentially  rational;  at  the  right  node  “fight”  yields  5 and 
“share”  yields  3,  so  that  “fight”  is  sequentially  rational. 

• For  PE-2,  at  the  top  information  set  “in”  yields  an  expected  payoff  of 
/?(0)  + (l-/?)(1.5)  and  “out”  yields  1,  so  that  “out”  is  sequentially  rational  as 
long  as  p > j ; at  the  middle  information  set,  given  her  belief  that  / is 
rational,  “in”  yields  1.5  and  “out”  yields  1,  so  that  “in”  is  sequentially 
rational;  at  the  bottom  information  set,  given  her  belief  that  / is  hotheaded, 
“in”  yields  0 and  “out”  yields  1,  so  that  “out”  is  sequentially  rational. 

Thus  the  equilibrium  described  above  captures  the  intuition  suggested  in 
Chapter  2,  namely  that  - even  though  it  is  common  knowledge  between  the 
Incumbent  and  PE-1  that  the  Incumbent  is  rational  and  thus  would  suffer  a loss 
of  1.5  by  fighting  PE-l’s  entry  - it  is  still  credible  for  the  Incumbent  to  threaten 
to  fight  PE-l’s  entry  because  it  would  influence  the  beliefs  of  PE-2  and  induce 
her  to  stay  out;  understanding  the  credibility  of  this  threat,  it  is  optimal  for 
PE-1  to  stay  out. 

The  above  argument  exploits  the  fact  that  the  notion  of  weak  sequential 
equilibrium  allows  for  any  beliefs  whatsoever  at  unreached  information  sets. 
However,  the  beliefs  postulated  for  PE-2  at  her  middle  and  bottom  information 
sets  seem  highly  plausible.  Indeed,  as  shown  in  Exercise  14.2,  the  assessment 
described  above  is  also  a sequential  equilibrium. 

We  now  turn  to  an  example  that  deals  with  the  issue  of  labor-management 
negotiations  and  the  inefficiency  of  strikes.  It  is  not  uncommon  to  observe  a 
firm  and  a union  engaging  in  unsuccessful  negotiations  leading  to  a strike  by 
the  workers,  followed  by  renewed  negotiations  and  a final  agreement.  Strikes 
are  costly  for  the  workers,  in  terms  of  lost  wages,  and  for  the  firm,  in  terms  of 
lost  production.  Why,  then,  don’t  the  parties  reach  the  agreement  at  the  very 
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beginning,  thus  avoiding  the  inefficiency  of  a strike?  The  answer  in  many  cases 
has  to  do  with  the  fact  that  there  is  incomplete  information  on  the  side  of  the 
labor  union  and  enduring  a strike  is  the  only  credible  way  for  the  firm  to 
convince  the  union  to  reduce  its  demands.  We  shall  illustrate  this  in  a simple 
example  of  one-sided  incomplete  information. 


Consider  the  following  game  between  a new  firm  and  a labor  union.  The  union 
requests  a wage  (either  high,  wH  , or  low,  wL)  and  the  firm  can  accept  or  reject. 
If  the  union’s  request  is  accepted,  then  a contract  is  signed  and  production  starts 
immediately.  If  the  request  is  rejected,  then  the  union  goes  on  strike  for  one 
period  and  at  the  end  of  that  period  makes  a second,  and  last,  request  to  the 
firm,  which  the  firm  can  accept  or  reject.  If  the  firm  rejects,  then  it  cannot 
enter  the  industry.  When  no  agreement  is  signed,  both  parties  get  a payoff  of  0. 
Both  firm  and  union  have  a discount  factor  of  S,  with  0 < S<  1,  which  means 
that  $1  accrued  one  period  into  the  future  is  considered  to  be  equivalent  to  $c!> 
at  the  present  time  (this  captures  the  cost  of  waiting  and  thus  the  desirability  of 
avoiding  a strike).  The  extensive-form  game  is  shown  in  Figure  14.11,  where  n 
denotes  the  firm’s  profit  (gross  of  labor  costs). 
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Union 


0 5wl  0 5wh  0 0 

0 8{n-wL)  0 8{ti-wh)  0 0 


Figure  14.11 

Suppose,  however,  that  the  union  does  not  know  the  firm’s  expected  profit  n 
(gross  of  labor  costs).  It  is  not  unlikely  that  the  management  will  know  what 
profits  to  expect,  while  the  union  will  not  (because,  for  example,  the  union  does 
not  have  enough  information  on  the  intensity  of  competition,  the  firm’s  non-labor 
costs,  etc.).  Suppose  we  have  a one-sided  incomplete  information  situation  where 
the  union  believes  that  n can  have  two  values:  nH  (high)  and  nL  (low)  and  assigns 
probability  a to  and  (1-a)  to  nv  This  situation  of  one-sided  incomplete 
information  is  illustrated  in  Figure  14.12. 
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Figure  14.12 


Let  7i H > 71  L > 0 and  wH  > wL  > 0 (so  that  H indeed  means  “high”  and  L indeed 
means  “low”).  We  also  assume  that  n - wR  > 0 (so  that  the  high-profit  firm 
could  in  fact  afford  to  pay  a high  wage),  nL~wL>  0 (that  is,  the  low  wage  is 
sufficiently  low  for  the  low-profit  firm  to  be  able  to  pay  it)  and  nL  - w < 0 (that  is, 
the  low-profit  firm  cannot  afford  to  pay  a high  wage).  Finally  we  assume  that  the 
true  state  is  the  one  where  n = nL,  that  is,  the  firm’s  potential  profits  are  in  fact 
low.  These  assumptions  imply  that  it  is  in  the  interest  of  both  the  firm  and  the 
union  to  sign  a contract  (even  if  the  firm’s  profits  are  low). 

Using  the  Harsanyi  transformation  we  can  convert  the  situation  shown  in  Figure 
14.12  into  the  extensive-form  game  shown  in  Figure  14.13. 
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Let  us  find  conditions  under  which  the  following  strategy  profile  is  part  of  a 
separating  weak  sequential  equilibrium: 

(f)  The  union  requests  a high  wage  in  the  first  period. 

(2)  The  high-profit  firm  accepts  while  the  low-profit  firm  rejects. 

(3)  After  rejection  of  the  first-period  high-wage  offer  the  union  requests  a low 
wage  and  the  firm  accepts. 

That  is,  we  want  an  equilibrium  where  the  low-profit  firm  endures  a strike  to 
signal  to  the  union  that  its  expected  profits  are  low  and  cannot  afford  to  pay  a 
high  wage.  The  union  reacts  to  the  signal  by  lowering  its  demand.  Of  course, 
we  need  to  worry  about  the  fact  that  the  high-profit  firm  might  want  to 
masquerade  as  a low-profit  firm  by  rejecting  the  first-period  offer  (that  is,  by 
sending  the  same  signal  as  the  low-profit  firm). 

First  of  all,  we  can  simplify  the  game  by  eliminating  the  second-period  choice 
of  the  firm  (under  our  assumptions  any  second-period  offer  will  be  accepted, 
except  for  wH  by  the  low  firm).  The  simplified  game  is  shown  in  Figure  14. f 4. 

Nature 
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In  order  for  a low-wage  offer  to  be  optimal  for  the  union  in  Period  2 it  is 
necessary  for  the  union  to  assign  sufficiently  high  probability  to  the  firm  being 
a low-profit  one:  if  p is  this  probability,  then  we  need  SwL  > (1  - p)SwH , that  is, 
w 

p > 1 — (for  example,  p = 1 would  be  fine).  For  the  high-profit  firm  not  to 


WH 

have  an  incentive  to  send  the  same  signal  as  the  low-profit  firm  it  is  necessary 
that  K H -wH  >S(7tH  - w, ) , that  is  8 < — — y^~ . If  this  condition  is  satisfied, 

then  the  high-profit  firm  will  accept  wH  in  period  1,  the  low-profit  firm  will 

reject  and  Bayesian  updating  requires  the  union  to  assign  probability  1 to  the 
right  node  of  its  middle  information  set  (that  is,  to  the  firm  being  a low-profit 
type),  in  which  case  adjusting  its  demand  to  wL  is  sequentially  rational.  On  the 

other  hand,  requesting  a low  wage  in  period  1 will  lead  to  both  types  of  the 
firm  accepting  immediately.  So  in  order  for  a high-wage  offer  to  be  optimal  for 
the  union  in  period  1 it  is  necessary  that  w,  < awH  +(1  -a)SwL,  which  will  be 


true  if  a is  sufficiently  large,  that  is,  if  a > — — — ^ — . Finally,  given  the  above 

wh-8wl 

equilibrium  choices,  the  notion  of  weak  sequential  equilibrium  imposes  no 
restrictions  on  the  beliefs  (and  hence  choice)  of  the  union  at  the  bottom 
information  set. 


For  example,  all  the  above  inequalities  are  satisfied  if:  % = 100,  nL  = 55, 
wH  - 60,  wL  = 50,  a = 0.7,  S - 0.6.  Then  % -wH  =40  >8  [nH  -wj  = 
0.6  (50)  = 30,  wL  = 50  < awH  + (1  -a)  8wL  = (0.7)  60  + 0.3  (0.6)  50  = 51. 


We  conclude  this  section  with  one  more  example  that  has  to  do  with  the 
effectiveness  of  truth-in-advertising  laws. 

Consider  the  case  of  a seller  and  a buyer.  The  seller  knows  the  quality  x of  his 
product,  while  the  buyer  does  not,  although  she  knows  that  he  knows.  The 
buyer  can  thus  ask  the  seller  to  reveal  the  information  he  has.  Suppose  that 
there  is  a truth-in-advertising  law  which  imposes  harsh  penalties  for  false  claims 
in  advertising.  This  is  not  quite  enough,  because  the  seller  can  tell  the  truth 
without  necessarily  revealing  all  the  information.  For  example,  if  x is  the  octane 
content  of  gasoline  and  x = 89,  then  the  following  are  all  true  statements: 

“the  octane  content  of  this  gasoline  is  at  least  70” 

“the  octane  content  of  this  gasoline  is  at  least  85” 
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“the  octane  content  of  this  gasoline  is  at  most  89” 

“the  octane  content  of  this  gasoline  is  exactly  89” 
etc. 

Other  examples  are:  (1)  the  fat  content  of  food  (the  label  on  a package  of 
ground  meat  might  read  “not  more  than  30%  fat”),  (2)  fuel  consumption  for 
cars  (a  car  might  be  advertised  as  yielding  “at  least  35  miles  per  gallon”),  (3)  the 
label  on  a mixed-nut  package  might  read  “not  more  than  40%  peanuts”,  etc. 

An  interesting  question  is:  Would  the  seller  reveal  all  the  information  he  has  or 
would  he  try  to  be  as  vague  as  possible  (while  being  truthful)?  In  the  latter  case, 
will  the  buyer  be  in  a worse  position  than  she  would  be  in  the  case  of  complete 
information? 

Milgrom  and  Roberts  (1986)  consider  the  following  game  (resulting  from 
applying  the  Harsanyi  transformation  to  the  situation  of  one-sided  incomplete 
information  described  above).  First  Nature  selects  the  value  of  x,  representing 
the  seller’s  information,  from  a finite  set  X;  as  usual,  the  probabilities  with 
which  Nature  chooses  are  a reflection  of  the  buyer’s  beliefs.  The  seller 
“observes”  Nature’s  choice  and  makes  an  assertion  A to  the  buyer;  A is  a subset 
of  X.  The  seller  is  restricted  to  make  true  assertions,  that  is,  we  require  that 
xeA.  The  buyer  observes  the  seller’s  claim  A and  then  selects  a quantity  q > 0 
to  buy.  Milgrom  and  Roberts  show  that  (under  reasonable  hypotheses),  if  the 
buyer  adopts  a skeptical  view  concerning  the  seller’s  claim,  that  is,  she  always 
interprets  the  seller’s  claim  in  a way  which  is  least  favorable  to  the  seller  (for 
example  “not  more  than  30%  fat”  is  interpreted  as  “exactly  30%  fat”),  then 
there  is  an  equilibrium  where  the  outcome  is  the  same  as  it  would  be  in  the  case 
of  complete  information.  We  shall  illustrate  this  phenomenon  by  means  of  a 
simple  example. 

Suppose  that  there  are  three  possible  quality  levels  for  the  good  under 
consideration:  low  (/),  medium  (m)  and  high  (h);  thus  X = {l,m,h}.  The  buyer 
believes  that  the  three  quality  levels  are  equally  likely.  The  buyer  has  to  choose 
whether  to  buy  one  unit  or  two  units.  The  seller  can  only  make  truthful 
claims.  For  example,  if  the  quality  is  /,  the  seller’s  possible  claims  are  {/}  (full 
revelation)  or  vague  claims  such  as  {l,m},  {l,h},  {l,m,h}.  The  extensive-form 
game  is  shown  in  Figure  14.15. 
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NATURE 


Figure  14.15 

It  is  straightforward  to  check  that  the  following  is  a weak  sequential 
equilibrium: 

• The  seller  claims  {/}  if  Nature  chooses  /,  { m } if  Nature  chooses  m and 
{h}  if  Nature  chooses  h (that  is,  the  seller  reveals  the  whole  truth  by 
choosing  not  to  make  vague  claims). 

• The  buyer  buys  one  unit  if  told  {/},  two  units  if  told  \m)  or  if  told  {h}; 
furthermore,  the  buyer  adopts  beliefs  which  are  least  favorable  to  the 
seller:  if  told  {l,m}  or  {l,h}  or  {l,m,h}  she  will  believe  / with  probability 
one  and  buy  one  unit,  if  told  {m,h}  she  will  believe  m with  probability 
one  and  buy  two  units.  All  these  beliefs  are  admissible  because  they 
concern  information  sets  that  are  not  reached  in  equilibrium  and 
therefore  Bayesian  updating  does  not  apply. 
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On  the  other  hand,  as  shown  in  Exercise  14.4,  if  the  buyer  is  naive  then  there  is 
an  equilibrium  where  the  seller  chooses  to  make  vague  claims. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced 
in  this  section,  by  going  through  the  exercises  in  Section  14.E.1  of  Appendix 
14. E at  the  end  of  this  chapter. 


14.2  Multi-sided  incomplete  information 

The  case  of  situations  of  multi-sided  incomplete  information  involving  dynamic 
games  is  conceptually  the  same  as  the  case  of  multi-sided  situations  of 
incomplete  information  involving  static  games.  In  this  section  we  shall  go 
through  one  example. 

Consider  the  following  situation  of  two-sided  incomplete  information.  A seller 
(player  S)  owns  an  item  that  a buyer  (player  B ) would  like  to  purchase.  The 
seller’s  reservation  price  is  s (that  is,  she  is  willing  to  sell  if  and  only  if  the  price 
paid  by  the  buyer  is  at  least  s)  and  the  buyer’s  reservation  price  is  b (that  is,  he 
is  willing  to  buy  if  and  only  if  the  price  is  less  than  or  equal  to  b).  It  is  common 
knowledge  between  the  two  that  (1)  both  b and  s belong  to  the  set  (1,2 ,...,«}, 
(2)  the  buyer  knows  the  value  of  b and  the  seller  knows  the  value  of  s,  (3)  both 
the  buyer  and  the  seller  attach  equal  probability  to  all  the  possibilities  among 
which  they  are  uncertain. 

Buyer  and  Seller  play  the  following  game.  First  the  buyer  makes  an  offer  of  a 
price  pe{l,...,n}  to  the  seller.  If  p = n the  game  ends  and  the  object  is 
exchanged  for  $p.  If  p < n then  the  seller  either  accepts  (in  which  case  the  game 
ends  and  the  object  is  exchanged  for  $p)  or  makes  a counter-offer  of  p'  > p , in 
which  case  either  the  buyer  accepts  (and  the  game  ends  and  the  object  is 
exchanged  for  $/?')  or  the  buyer  rejects,  in  which  case  the  game  ends  without 
an  exchange.  Payoffs  are  as  follows: 

if  there  is  no  exchange 
if  exchange  takes  place  at  price  $x 


ft seller 
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n, 


buyer 


0 if  there  is  no  exchange 

< b- p if  exchange  takes  place  at  price  $p  (the  initial  offer) 
b — p'  — s if  exchange  takes  place  at  price  $//  (the  counter  - offer) 


where  s > 0 is  a measure  of  the  buyer’s  “hurt  feelings”  for  seeing  his  initial 
offer  rejected.  These  are  von  Neumann-Morgenstern  payoffs. 

Let  us  start  by  focusing  on  the  case  n = 2.  First  we  represent  the  situation 
described  above  by  means  of  an  interactive  knowledge-belief  structure.  A 
possible  state  can  be  written  as  a pair  ( b , s ) , where  b is  the  reservation  price  of 
the  buyer  and  s that  of  the  seller.  Thus  when  n = 2 the  possible  states  are  (1,1), 
(1,2),  (2,1)  and  (2,2).  Figure  14.16  below  represents  this  two-sided  situation  of 
incomplete  information. 


(1-1) 

1 

V 2 

(L2) 

1 

2 

BUYER: 

(2,1) 

(2,2) 

1 

V 2 

1 

2 

SELLER: 


Next  we  apply  the  Harsanyi  transformation  to  the  situation  illustrated  in 
Figure  14.16.  For  each  state  there  is  a corresponding  extensive-form  game.  The 
four  possible  games  are  shown  in  Figure  14.17  below  (the  top  number  is  the 
Buyer’s  payoff  and  the  bottom  number  the  Seller’s  payoff). 
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The  extensive-form  game  that  results  from  applying  the  Harsanyi 
transformation  to  the  situation  illustrated  in  Figure  14.16  is  shown  in  Figure 
14.18  below. 


(i,i) 

1/4 


BUYER 


NATURE 

(1,2)  (2,1) 

1/4  1/4 


(2,2) 
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p 7 x P7  \-i 

-1  / 0 
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P=1  / 
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1 

-1 


'SELLER 


YeS/  \ p'=2 


° i i 


BUYER 


\ P'=2 


I 


— \ — - 

\ 


Yes  / \ No 


Yes  / 


Yes  / \ No 


\ No 


t \ t \ 


-1-8  0 

1 0 


-1-8 

0 


t X 


/ X 


Figure  14.18 


Let  us  find  all  the  pure-strategy  weak  sequential  equilibria  of  the  game  of  Figure 
14.18.  First  of  all,  note  that  at  the  bottom  information  sets  of  the  Buyer,  “Yes” 
is  strictly  dominated  by  “No”  and  thus  a weak  sequential  equilibrium  must 
select  “No”.  Hence  the  game  simplifies  to  the  one  shown  in  Figure  14.19 
below. 
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NATURE 


(1.1) 

1/4 


(2,2) 

1/4 


BUYER 


(1,2)  (2,1) 
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0 
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Figure  14.19 

In  the  game  of  Figure 

14.19, 

at  the  middle  information  of  the  Seller,  making  a 

counteroffer  of  p'  = 2 

strictly  dominates  “Yes”.  Thus  the 

game  can  be  further 

simplified  as  shown  in 

Figure 

14.20. 

NATURE 

Figure  14.20 


In  the  game  of  Figure  14.20  at  the  left  information  set  of  the  Buyer  p = 1 
strictly  dominates  p = 2.  At  the  right  information  set  the  Buyer’s  beliefs  must 
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be  j on  each  node  so  that  if  the  Seller’s  strategy  is  to  say  Yes,  then/*  = 1 is  the 
only  sequentially  rational  choice  there,  otherwise  both  p = 1 and  p = 2 are 
sequentially  rational.  Thus  the  pure-strategy  weak  sequential  equilibria  of  the 
reduced  game  shown  in  Figure  14.20  are  as  follows: 

(1)  ((/?  = l,p  = l),7es)  with  beliefs  given  by  probability  Vi  on  each  node  at 
every  information  set. 

(2)  = l,p  = l),  p'  = 2)  with  beliefs  given  by  probability  V2  on  each 
node  at  every  information  set. 

(3)  ((/?  = l,p  = 2),  p =2)  with  beliefs  given  by  ( i ) probability  V2  on  each 

node  at  both  information  sets  of  the  Buyer  and  ( ii ) probability  1 on  the 
left  node  at  the  information  set  of  the  Seller. 

These  equilibria  can  be  extended  to  the  original  game  of  Figure  14.18  as 
follows: 

(1)  ((/?  = \,p  = 1, No,No),(p'  = 2, Tes))  with  beliefs  given  by  ( i ) probability 

V2  on  each  node  at  both  information  sets  of  the  Buyer  at  the  top  and  at 
both  information  sets  of  the  Seller  and  (ii)  probability  1 on  right  node  at 
both  information  sets  of  Buyer  at  the  bottom.  The  corresponding 
payoffs  are:  x/\  for  the  Buyer  and  0 for  the  Seller. 

(2)  ((/?  = \,p  = 1, No, No),(p'  = 2,p'  = 2))  with  beliefs  given  by  probability 

V2  on  each  node  at  every  information  set.  The  corresponding  payoffs 
are  0 for  both  Buyer  and  Seller. 

(3)  ((/>  = \,p  = 2,  No,  No),(p'  = 2,p'  = 2))  with  beliefs  given  by  ( i ) 

probability  V2  on  each  node  at  both  information  sets  of  the  Buyer  at 
the  top  and  at  the  lower  left  information  set  of  the  Buyer,  (ii)  any  beliefs 
at  the  lower  right  information  set  of  the  Buyer  and  (in)  probability  1 on 
the  left  node  at  each  information  set  of  the  Seller.  The  corresponding 
payoffs  are:  0 for  the  Buyer  and  lA  for  the  Seller. 
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Now  let  us  consider  the  case  n = 100.  Drawing  the  interactive  knowledge-belief 
structure  and  the  corresponding  extensive-form  game  (obtained  by  applying  the 
Harsanyi  transformation)  is  clearly  not  practical.  However,  one  can  still  reason 
about  what  could  be  a pure-strategy  Bayesian  Nash  equilibrium  of  that  game. 
As  a matter  of  fact,  there  are  many  Bayesian  Nash  equilibria.  One  of  them  is 
the  following:  ( i ) each  type  of  the  Buyer  offers  a price  equal  to  his  reservation 
price,  (11)  the  Seller  accepts  if,  and  only  if,  that  price  is  greater  than  or  equal  to 
her  reservation  price,  (iii)  at  information  sets  of  the  Seller  that  are  not  reached 
the  Seller  rejects  and  counteroffers  $100,  (iv)  at  information  sets  of  the  Buyer 
that  are  not  reached  the  Buyer  says  “No”.  The  reader  should  convince 
himself/herself  that  this  is  indeed  a Nash  equilibrium. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced 
in  this  section,  by  going  through  the  exercises  in  Section  14. E. 2 of  Appendix 
14. E at  the  end  of  this  chapter. 
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Appendix  14. E:  Exercises 

14.E.1.  Exercises  for  Section  14.1: 

One-sided  incomplete  information 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  14.1.  Consider  the  following  situation  of  one-sided  incomplete 
information.  Players  1 and  2 are  playing  the  extensive-form  game  shown  below 
(where  z,,  z2,  etc.  are  outcomes  and  the  question  mark  stands  for  either 
outcome  z5  or  outcome  z6).  The  outcome  that  is  behind  the  question  mark  is 
actually  outcome  z5  and  Player  1 knows  this,  but  Player  2 does  not  know. 
Player  2 thinks  that  the  outcome  behind  the  question  mark  is  either  z5  or  z6 
and  assigns  probability  25%  to  it  being  z5  and  probability  75%  to  it  being  z6 . 
Player  2 also  thinks  that  whatever  the  outcome  is,  Player  1 knows  (that  is,  if  it 
is  z5 , then  Player  1 knows  that  it  is  z5 , and  if  it  is  z6  then  Player  1 knows  that 

it  is  z6).  The  beliefs  of  Player  2 are  common  knowledge  between  the  two 
players. 


A 1 


(a)  Represent  this  situation  of  incomplete  information  using  an  interactive 
knowledge-belief  structure. 

(b)  Apply  the  Harsanyi  transformation  to  transform  the  situation  represented 
in  part  (a)  into  an  extensive-form  frame.  [Don’t  worry  about  payoffs  for  the 
moment.] 

From  now  on  assume  that  the  following  is  common  knowledge:  (f)  both 
players  have  von  Neumann-Morgenstern  preferences,  (2)  the  ranking  of  Player 
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( best 

lz4>z, 

lottery 


second 

zi 


worst 

Z2>Z3’Z5 


and  he  is  indifferent  between  z,  for  sure  and  the 


1 is 


she  is  indifferent  between  z4  for  sure  and  the  lottery 
indifferent  between  z2  for  sure  and  the  lottery 


( z6  z5  | , (3)  the  ranking  of  player  2 is 

best 

second 

third 

worst] 

(0.5  0.5  J 

V.  Z6 

Z4 

Z2’Z5 

ZPZ3  J 

0.5  0.5 


and  she  is  also 


r 


v 


Z6  Z3 

0.25  0.75 


(c)  Calculate  the  von  Neumann-Morgenstern  normalized  utility  functions  for 
the  two  players. 

(d)  Is  there  a weak  sequential  equilibrium  of  the  game  of  part  (b)  where  Player 
1 always  plays  A (thus  a pooling  equilibrium)?  Explain  your  answer. 

(e)  Is  there  a weak  sequential  equilibrium  of  the  game  of  part  (b)  where  player 
1 always  plays  C (thus  a pooling  equilibrium)?  Explain  your  answer. 

(f)  Is  there  a pure-strategy  weak  sequential  equilibrium  of  the  game  of  part  (b) 
where  Player  1 does  not  always  choose  the  same  action  (thus  a separating 
equilibrium)? 


Exercise  14.2.  Show  that  the  assessment  highlighted  in  the  following  figure, 
which  reproduces  Figure  14.10,  is  a sequential  equilibrium  as  long  as  p > }. 


4 7 1-5  5 
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Exercise  14.3.  (a)  For  the  case  where  find  a pure-strategy  weak 

sequential  equilibrium  of  the  game  shown  in  Exercise  14.1  (which  reproduces 
Figure  14.10)  where  PE-1  stays  out  but  PE-2  enters;  furthermore,  the 
Incumbent  would  fight  PE-1  if  she  entered.  Prove  that  what  you  propose  is  a 
weak  sequential  equilibrium. 

(b)  Either  prove  that  the  weak  sequential  equilibrium  of  part  (a)  is  a sequential 
equilibrium  or  prove  that  it  is  not  a sequential  equilibrium. 


Exercise  14.4.  Show  that  in  the  “truth-in-advertising”  game  of  Figure  14.15 
there  is  a weak  sequential  equilibrium  where  the  seller  makes  vague  claims 
(that  is,  does  not  reveal  the  whole  truth). 


Exercise  14.5.  Consider  a simpler  version  of  the  “truth-in-advertising”  game, 
where  there  are  only  two  quality  levels:  L and  H.  The  payoffs  are  as  follows: 

• If  the  quality  is  L and  the  buyer  buys  one  unit,  the  seller’s  payoff  is  9 
and  the  buyer’s  payoff  is  1, 

• If  the  quality  is  L and  the  buyer  buys  two  units,  the  seller’s  payoff  is  18 
and  the  buyer’s  payoff  is  0, 

• If  the  quality  is  H and  the  buyer  buys  one  unit,  the  seller’s  payoff  is  6 
and  the  buyer’s  payoff  is  2, 

• If  the  quality  is  H and  the  buyer  buys  two  units,  the  seller’s  payoff  is  12 
and  the  buyer’s  payoff  is  3, 

Let  the  buyer’s  initial  beliefs  be  as  follows:  the  good  is  of  quality  L with 
probability  p e (0,1)  (and  of  quality  H with  probability  1 -p). 

(a)  Draw  the  extensive-form  game  that  results  from  applying  the  Harsanyi 
transformation  to  this  one-sided  situation  of  incomplete  information. 

(b)  Find  the  pure-strategy  subgame-perfect  equilibria  of  the  game  of  part  (a)  for 

every  possible  value  of  p e (0, 1) . 
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14. E. 2.  Exercises  for  Section  14.2: 

Multi-sided  incomplete  information. 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  14.6.  Consider  the  following  situation  of  two-sided  incomplete 
information.  Players  1 and  2 are  having  dinner  with  friends  and  during  dinner 
Player  2 insinuates  that  Player  1 is  guilty  of  unethical  behavior.  Player  1 can 
either  demand  an  apology  (D)  or  ignore  (/)  Player  2’s  remark;  if  Player  1 
demands  an  apology,  then  Player  2 can  either  apologize  (A)  or  refuse  to 
apologize  {not- A);  if  Player  2 refuses  to  apologize,  Player  1 can  either  concede 
(C)  or  start  a fight  ( F ).  Thus  the  extensive  form  is  as  follows  (where  zl,...,z4  are 
the  possible  outcomes): 

1 


1=  ignore,  D = demand  apology 
A = apologize,  not  A = not  apologize 
C = concede,  F = fight 


Let  Ui  be  the  von  Neumann-Morgenstern  utility  function  of  Player  i ( i = 1,2). 
The  following  is  common  knowledge  between  Players  1 and  2: 

Ul(zl)  = 4,  Ux(z2)  = 2,Ux(z3)  = 6 
U2(z\)  = 6,  U2(z2)  = 8,  U2(z3)  = 2,  U2(z4)  = 0 . 
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The  only  uncertainty  concerns  the  value  of  Ufz4) . As  a matter  of  fact, 
£/j(z4)  = 0;  Player  1 knows  this,  but  Player  2 does  not.  It  is  common 
knowledge  between  Players  1 and  2 that  Player  2 thinks  that  either  Ul  (z4)  = 0 
or  t/j  (z4 ) = 3 . Furthermore,  as  a matter  of  fact,  Player  2 assigns  probability  \ 
to  t/j (z4 ) = 0 and  probability  f to  {7j(z4)  = 3;  Player  1 is  uncertain  as  to 


whether  Player  2’s  beliefs  are 
attaches  probability  \ to  each. 


CA(z4)  = 0 b/j(z4)  = 3 


(a)  Construct  a three-state  interactive  knowledge-belief  structure  that  captures 
all  of  the  above. 


(b)  Draw  the  extensive-form  game  that  results  from  applying  the  Harsanyi 
transformation  to  the  situation  of  part  (a). 

(c)  Find  the  pure-strategy  weak  sequential  equilibria  of  the  game  of  part  (b). 


000000000000 

Exercise  14.7:  Challenging  Question. 

There  are  two  parties  to  a potential  lawsuit:  the  owner  of  a chemical  plant  and 
a supplier  of  safety  equipment.  The  chemical  plant  owner,  from  now  on  called 
the  plaintiff,  alleges  that  the  supplier,  from  now  on  called  the  defendant,  was 
negligent  in  providing  the  safety  equipment.  The  defendant  knows  whether  or 
not  he  was  negligent,  while  the  plaintiff  does  not  know;  the  plaintiff  believes 
that  there  was  negligence  with  probability  q.  These  beliefs  are  common 
knowledge  between  the  parties.  The  plaintiff  has  to  decide  whether  or  not  to 
sue.  If  she  does  not  sue  then  nothing  happens  and  both  parties  get  a payoff  of  0. 
If  the  plaintiff  sues  then  the  defendant  can  either  offer  an  out-of-court 
settlement  of  $S  or  resist.  If  the  defendant  offers  a settlement,  the  plaintiff  can 
either  accept  (in  which  case  her  payoff  is  S and  the  defendant’s  payoff  is  -S)  or 
go  to  trial.  If  the  defendant  resists  then  the  plaintiff  can  either  drop  the  case  (in 
which  case  both  parties  get  a payoff  of  0)  or  go  to  trial.  If  the  case  goes  to  trial 
then  legal  costs  are  created  in  the  amount  of  $P  for  the  plaintiff  and  $D  for  the 
defendant.  Furthermore  (if  the  case  goes  to  trial),  the  judge  is  able  to  determine 
if  there  was  negligence  and,  if  there  was,  requires  the  defendant  to  pay  $ W to 
the  plaintiff  (and  each  party  has  to  pay  its  own  legal  costs),  while  if  there  was 
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no  negligence  the  judge  will  drop  the  case  without  imposing  any  payments  to 
either  party  (but  each  party  still  has  to  pay  its  own  legal  costs).  It  is  common 
knowledge  that  each  party  is  “selfish  and  greedy”  (that  is,  only  cares  about  its 
own  wealth  and  prefers  more  money  to  less)  and  is  risk  neutral. 

Assume  the  following  about  the  parameters: 

0<?<1,  0 <D  < S,  0<P<S<IV-P. 

(a)  Represent  this  situation  of  incomplete  information  by  means  of  an 
interactive  knowledge-belief  structure  (the  only  two  players  are  the  plaintiff 
and  the  defendant). 

(b)  Apply  the  Harsanyi  transformation  to  represent  the  situation  in  part  (a)  as 
an  extensive-form  game.  [Don’t  forget  to  subtract  the  legal  expenses  from 
each  party’s  payoff  if  the  case  goes  to  trial.] 

(c)  Write  down  all  the  strategies  of  the  plaintiff. 

(d)  Prove  that  there  is  no  pure-strategy  weak  sequential  equilibrium  which 
(1)  is  a separating  equilibrium  and  (2)  involves  suing. 

(e)  For  what  values  of  the  parameters  (q,  S,  P,  W,  D)  are  there  pure-strategy 
weak  sequential  equilibria  which  (f)  are  pooling  equilibria  and  (2)  involve 
suing?  Consider  all  types  of  pooling  equilibria  and  prove  your  claim. 

(f)  For  the  case  where  q = -jV,  P = 70,  S = 80,  W = 100  find  all  the  pure-strategy 
weak  sequential  equilibria  which  (1)  are  pooling  equilibria  and  (2)  involve 
suing.  [Note  that  here  we  have  dropped  the  assumption  that  S < W-P .] 
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Appendix  14. S:  Solutions  to  exercises 

Exercise  14.1.  (a)  Let  G:  be  the  game  with  outcome  z5  and  G2  the  game  with 
outcome  z6 . Then  the  structure  is  as  follows,  with  q = \.  The  true  state  is  a. 


a P 


(b)  The  extensive  form  is  as  follows: 


Nature 
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(c) 


Ux : 0.5  0 0 

JJ2:  0 0.25  0 


1 0 1 
0.5  0.25  1 


Adding  these  payoffs  to  the  extensive  form  we  obtain  the  following  game: 


Nature 


o 0 1 0 0 Oil 

0-25  o 0.5  0.25  0.25  0 0.5  1 


(d)  No,  because  at  the  right  node  of  Player  1,  C gives  a payoff  of  1 no  matter 
what  Player  2 does  and  thus  Player  1 would  choose  C rather  than  A (which 
only  gives  him  a payoff  of  0.5). 

(e)  If  Player  1 always  plays  C (that  is,  his  strategy  is  CC),  then  - by  Bayesian 
updating  - Player  2 should  assign  probability  | to  node  y and  probability  f to 
node  2,  in  which  case  D gives  her  a payoff  of  0.5  while  E gives  her  a payoff  of 
|(0.25)  + |(1)  = 0.8125 ; hence  she  must  choose  E.  But  then  at  his  left  node 
Player  1 with  C gets  0 while  with  A he  gets  0.5.  Hence  choosing  C at  the  left 
node  is  not  sequentially  rational. 

(e)  Yes,  the  following  is  a weak  sequential  equilibrium:  a = (AC,E)  with 

. It  is  straightforward  to  verify  that  sequential 
rationality  and  Bayesian  updating  are  satisfied. 


beliefs  //  = 


x y w z 

n n n i 
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Exercise  14.2.  Sequential  rationality  was  verified  in  Section  14.1.  Thus  we  only 
need  to  show  that  the  highlighted  pure-strategy  profile  together  with  the 

following  system  of  beliefs  //  = 

nodes  refer  to  the  following  figure)  constitutes  a consistent  assessment 
(Definition  11.1,  Chapter  11).  Let  (cr„)  12  be  the  sequence  of  completely 
mixed  strategies  shown  in  the  following  figure. 


s t 

u 

V 

X 

,p  1-p 

0 

1 

1 

o, 

(for  the  names  of  the 


0 l 1-5  1 

4 7 1-5  5 


It  is  clear  that  lim  an  =cr.  Let  us  calculate  the  system  of  beliefs  jun  obtained 

n— >oo 

MM) 


from  an  by  using  Bayesian  updating:  /un(s)  = 


MM)+(M)(m) 


and 
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lim//»  = 


= Pl 


p + (l-p) 
and  lim  pn(u)  = 0 ; 

n— >oo 


/*„(«)  = 


^(D(^)  _ p 

Mi0)+(i-p)(z)(i)  p+<1~p) 

and  lim  jun  (x)  = 1 (the  numerator  tends  to 


p{\)  = p and  the  denominator  tends  to  /?(1)  + (1-/?)(0)(1)  = p).  Thus  (cr,//)  is 
consistent  and  sequentially  rational  and  therefore  it  is  a sequential  equilibrium. 


Exercise  14.3.  (a)  Consider  the  following  pure-strategy  profile,  call  it  cr,  which 
is  highlighted  in  dark  pink  in  the  figure  below: 

(1)  PE-l’s  strategy  is  “out”  at  both  nodes, 

(2)  PE-2’s  strategy  is  “in”  at  the  top  information  set  (after  having  observed  that 
PE-1  stayed  out),  “in”  at  the  middle  information  set  (after  having  observed  that 
PE-l’s  entry  was  followed  by  the  Incumbent  sharing  the  market  with  PE-1)  and 
“out”  at  the  bottom  information  set  (after  having  observed  that  PE-l’s  entry 
was  followed  by  the  Incumbent  fighting  against  PE-1), 

(3)  the  Incumbent’s  strategy  is  to  fight  entry  of  PE-1  in  any  case  (that  is, 
whether  the  Incumbent  himself  is  hotheaded  or  rational). 

We  want  to  show  that  cr,  together  with  the  system  of  beliefs  p shown  in  the 
Figure  below  (where  at  her  top  information  PE-2  assigns  probability  p to  the 
Incumbent  being  hotheaded,  at  her  middle  information  set  she  assigns 
probability  1 to  the  Incumbent  being  rational  and  at  her  bottom  information 
set  she  assigns  probability  1 to  the  Incumbent  being  hotheaded)  constitutes  a 
weak  sequential  equilibrium  for  any  value  of  Bayesian  updating  is 

satisfied  at  PE-2’s  top  information  set  (the  only  non-singleton  information  set 
reached  by  o),  and  at  the  other  two  information  sets  of  PE-2  any  beliefs  are 
allowed  by  the  notion  of  weak  sequential  equilibrium.  Sequential  rationality 
also  holds: 

• For  PE-1,  at  the  left  node  “in”  yields  0 and  “out”  yields  1,  so  that  “out”  is 
sequentially  rational;  the  same  is  true  at  the  right  node. 


• For  the  Incumbent,  at  the  left  node  “fight”  yields  7 and  “share”  yields  3.5, 
so  that  “fight”  is  sequentially  rational;  at  the  right  node  “fight”  yields  5 and 
“share”  yields  3,  so  that  “fight”  is  sequentially  rational. 
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• For  PE-2,  at  the  top  information  set  “in”  yields  an  expected  payoff  of 
/?(0)  + (l  — y?)(l  .5)  = (1  -/?)(1.5)  and  “out”  yields  1,  so  that  “in”  is 
sequentially  rational  as  long  as  p < y ; at  the  middle  information  set  “in” 
yields  1.5  and  “out”  yields  1,  so  that  “in”  is  sequentially  rational;  at  the 
bottom  information  set  “in”  yields  0 and  “out”  yields  1,  so  that  “out”  is 
sequentially  rational. 


4 ^ 1.5  5 


(b)  The  assessment  described  in  part  (a)  is  in  fact  a sequential  equilibrium.  This 
can  be  shown  using  the  sequence  of  completely  mixed  strategies  marked  in 
the  figure  above,  which  coincides  with  the  sequences  of  mixed  strategies 
considered  in  Exercise  14.2;  thus  the  calculations  to  show  consistency  are 
identical  to  the  ones  carried  out  in  Exercise  14.2. 
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Exercise  14.4.  The  game  under  consideration  is  the  following  (which 
reproduces  Figure  14.15): 


NATURE 


Let  the  buyer  be  naive,  in  the  sense  that  at  every  unreached  information  set  she 
assigns  equal  probability  to  each  node;  furthermore,  let  the  buyer’s  strategy  be 
as  follows:  if  told  {/}  buy  one  unit,  in  every  other  case  buy  two  units.  Let  the 
seller’s  strategy  be  as  follows:  if  Nature  chooses  / or  m claim  {I,m}  and  if 
Nature  chooses  h then  claim  {h}.  Let  us  verify  that  this  assessment  constitutes  a 
weak  sequential  equilibrium.  The  only  non-singleton  information  set  that  is 
reached  by  the  strategy  profile  is  the  top  information  set  (where  the  buyer 
hears  the  claim  {l,m})  and  Bayesian  updating  requires  the  buyer  to  assign  equal 
probability  to  each  node.  Every  other  non-singleton  information  set  is  not 
reached  and  thus  the  notion  of  weak  sequential  equilibrium  allows  any  beliefs: 
in  particular  beliefs  that  assign  probability  \ to  each  node.  Now  we  check 
sequential  rationality.  Let  us  start  with  the  buyer.  At  the  singleton  node 
following  claim  {/},  buying  one  unit  is  optimal  and  at  the  singleton  nodes 
following  claims  { m}  and  {h}  buying  two  units  is  optimal.  At  the  top 
information  set  (after  the  claim  {l,m})  choosing  one  unit  gives  an  expected 
payoff  of  4(1)  + |(2)  = 1-5  and  choosing  two  units  yields  + (0)  + 4(3)  = 1.5  , thus 
buying  two  units  is  sequentially  rational.  At  the  information  set  following 
claim  {l,h}  choosing  one  unit  gives  an  expected  payoff  of  4(1)  + 4(3)  = 2 while 
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choosing  two  units  yields  -y  (0)  + (6)  = 3 , thus  buying  two  units  is  sequentially 
rational.  At  the  information  set  following  claim  {l,m,h}  choosing  one  unit  gives 
an  expected  payoff  of  j(l)  + }(2)  + }(3)  = 2 while  choosing  two  units  yields 
}(0)  + }(3)  + j(6)  = 3 , thus  buying  two  units  is  sequentially  rational.  Finally,  at 
the  information  set  following  claim  {m,h}  choosing  one  unit  gives  an  expected 
payoff  of  4(2)  + 4(3)  = 2.5  while  choosing  two  units  yields  4(3)  + 4(6)  = 4.5  , 
thus  buying  two  units  is  sequentially  rational.  Now  let  us  check  sequential 
rationality  for  the  seller’s  strategy.  At  the  left  node  (after  Nature  chooses  /) 
claiming  {/}  yields  a payoff  of  9,  while  every  other  claim  yields  a payoff  of  f8. 
Thus  {l,m}  is  sequentially  rational.  At  the  other  two  nodes,  the  seller  is 
indifferent  among  all  the  claims,  because  they  yield  the  same  payoff  (12  at  the 
node  after  Nature  chooses  m and  10  at  the  node  after  Nature  chooses  h)  thus 
the  postulated  choices  are  sequentially  rational. 

Exercise  14.5.  (a)  The  extensive  form  is  as  follows: 


NATURE 


(b)  At  any  subgame-perfect  equilibrium,  the  buyer  buys  one  unit  at  the 
singleton  information  set  on  the  left  and  two  units  at  the  singleton 
information  set  on  the  right.  Thus  we  can  simplify  the  game  by  replacing 
the  buyer’s  node  on  the  left  with  the  payoff  vector  (9,1)  and  the  buyer’s 
node  on  the  right  with  the  payoff  vector  (12,3).  The  strategic  form 
corresponding  to  the  simplified  game  is  as  follows  (the  buyer’s  strategy 
refers  to  the  buyer’s  only  information  set  that  has  remained,  where  he  is 
told  LH)\ 
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Buyer 


Buy  1 

unit 

Buy  2 units 

L,  LH 

9p  + 6(1 -p), 

p + 2(1  —p) 

18p  + 12(1  — p),  3(  1 — p) 

Seller  L,  H 

9p  + 12(1  — p), 

P + 3(1— p) 

9p  + 12(1  — p),  p + 3(  1 — p) 

LH,  LH 

9p  + 6(1 -p), 

P + 2(1 -p) 

18p  + 12(1  -p),  3(1 -p) 

LH,H 

9p  + 12(1  -p), 

P + 3(1 -p) 

18p  + 12(1  -p),  3(1 -P) 

simplifying: 

Buy  1 unit  Buy  2 units 


L,  LH 

6 + 3p, 

2 - P 

12  + 6p, 

CL 

CO 

I 

CO 

Seller  L,  H 

cl 

CO 

l 

CM 

3 - 2p 

CL 

CO 

1 

CN 

CL 

CN 

1 

CO 

LH,  LH 

6 + 3p, 

2 - P 

12  + 6p, 

CL 

CO 

1 

CO 

LH,H 

CL 

CO 

1 

CM 

3 - 2p 

12  + 6p, 

CL 

CO 

1 

CO 

Since  p < 1,  12-3 p >6  + 3 p.  Thus  ((LH,H),  1)  and  {{L,H),  1)  are  always  Nash 
equilibria  for  every  value  or  p.  Note  that  2 -p  > 3 — 3jt?  if  and  only  if  p>\\ 
thus  if  P>~2  then  there  are  no  other  pure-strategy  Nash  equilibria.  On  the 
other  hand,  if  then  the  following  are  also  Nash  equilibria:  ((L,LH), 2)  and 

((LH,LH), 2).  To  sum  up,  the  pure-strategy  Nash  equilibria  of  this  normal  form, 
which  correspond  to  the  pure-strategy  subgame-perfect  equilibria  of  the  reduced 
extensive  form,  are  as  follows: 

• If  p>\,  ((L//T41)  and  ((LT^l); 

• If  p<\,  ((fH,H),  1),  {{L,H),  1),  {(L,LH},2)  and  ((LH,LH), 2). 
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Exercise  14.6.  (a)  Let  and  G,  be  the  following  perfect-information  games 
(whose  backward  induction  solution  has  been  highlighted  in  dark  pink): 


The  interactive  knowledge-belief  structure  is  as  follows: 


game  game  game 

G1  G2  G1 


a i 

P 

r i 

2: 


true 

state 
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(b)  The  common  prior  is  given  by 


'a  P y ^ 

ill' 

V 5 5 5 J 


yields  the  following  game: 


The  Harsanyi  transformation 


Nature 


8 0 8 0 

(c)  At  the  bottom  information  set  of  Player  1,  C strictly  dominates  F and  thus 
we  can  replace  the  two  nodes  in  that  information  set  with  the  payoff  vector 
(2,8).  At  the  bottom  singleton  node  of  Player  1,  F strictly  dominates  C and 
thus  we  can  replace  that  node  with  the  payoff  vector  (3,0).  Thus,  by 
appealing  to  sequential  rationality,  the  game  can  be  reduced  to  the 
following: 
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Nature 


8 8 

Now  at  the  bottom-right  node  of  Player  2,  n-A  strictly  dominates  A and  thus 
we  can  replace  that  node  with  the  payoff  vector  (2,8)  and  further  simplify  the 
game  as  follows: 

Nature 


8 
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The  following  are  the  pure-strategy  weak  sequential  equilibria  of  the  reduced 
game  (which  can  be  extended  into  weak  sequential  equilibria  of  the  original 
game  by  adding  the  choices  that  were  selected  during  the  simplification 
process): 


((/,/),  n-A)  with  beliefs  ju  = 


r u v 

X 

y ] 

i i 

V 2 2 

P 

x~Pj 

for  any  p>\. 


(( l,D ),  A)  with  beliefs  //  = 


(( D,D ),  A)  with  beliefs  //  = 


r U V 

X 

V ^ 

1 1 

V 2 2 

p 

X-Pj 

(u  V 

1 x 

for  any  p<\. 


I i 
\2  2 


1 1 
4 4 7 


Exercise  14.7  (Challenging  Question),  (a)  Let  G,  and  G2  be  the  following 
games  (in  G1  the  defendant  is  negligent  and  in  G2  he  is  not): 


GAME  Gj 

(defendant  is  negligent) 


not  sue 


GAME  G2 

(defendant  is  not  negligent) 


not  sue 


Then  the  situation  can  be  represented  as  follows  (at  state  a is  the  defendant  is 
negligent  and  at  state  /3  the  defendant  is  not  negligent): 
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G 


1 


1: 


2:  [ * q (1  -q) 

a P 

(b)  The  extensive-form  game  is  as  follows: 


NATURE 
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(c)  Plaintiff’s  strategies:  (1)  (sue;  if  offer  settle;  if  resist  drop),  (2)  (sue;  if  offer 
settle;  if  resist  go  to  trial),  (3)  (sue;  if  offer  go  to  trial;  if  resist  drop),  (4)  (sue; 
if  offer  go  to  trial;  if  resist  go  to  trial),  (5)  (not  sue;  if  offer  settle;  if  resist 
drop),  (6)  (not  sue;  if  offer  settle;  if  resist  go  to  trial),  (7)  (not  sue;  if  offer  go 
to  trial;  if  resist  drop),  (8)  (not  sue;  if  offer  go  to  trial;  if  resist  go  to  trial). 

(d)  There  are  two  possibilities  for  a separating  equilibrium:  (1)  the  defendant’s 
strategy  is  “resist  if  negligent  and  offer  if  not  negligent”  and 
(2)  the  defendant’s  strategy  is  “offer  if  negligent  and  resist  if  not  negligent”. 
In  both  cases  we  assume  that  the  plaintiff’s  strategy  involves  suing  with 
probability  1. 

Consider  Case  1 first.  By  Bayesian  updating,  at  the  bottom  information  set 
the  plaintiff  must  attach  probability  1 to  the  negligent  type  and  thus,  by 
sequential  rationality,  must  choose  “trial”  (because  W - P > 0).  Similarly, 
by  Bayesian  updating,  at  the  middle  information  set  the  plaintiff  must 
attach  probability  1 to  the  non-negligent  type  and  thus  by  sequential 
rationality  must  choose  “settle”.  But  then  the  negligent  type  of  the 
defendant  gets  ~{W+D)  by  resisting  and  would  get  -S  by  offering.  Since,  by 
assumption,  S < W (<  W + D),  the  choice  of  resisting  is  not  sequentially 
rational. 

Consider  now  Case  2.  By  Bayesian  updating,  at  the  bottom  information  set 
the  plaintiff  must  attach  probability  1 to  the  non-negligent  type  and  thus  by 
sequential  rationality  must  choose  “drop”.  But  then  the  negligent  type  of 
the  defendant  gets  a negative  payoff  by  offering,  while  he  would  get  0 by 
resisting.  Hence  the  choice  of  offering  is  not  sequentially  rational. 

(e)  There  are  two  candidates  for  a pure-strategy  pooling  equilibrium:  (1)  both 
types  of  the  defendant  choose  “offer”  and  (2)  both  types  of  the  defendant 
choose  “resist”. 

Consider  Case  1 first  (both  types  of  the  defendant  choose  “offer”).  In  order 
for  “offer”  to  be  sequentially  rational  for  the  non-negligent  type,  it  cannot 
be  that  the  plaintiff’s  strategy  involves  “settle”  at  the  middle  information  set 
(the  non-negligent  type  would  get  either  0 or  -D  by  resisting  and  both 
payoffs  are  greater  than  -S)  and/or  “drop”  at  the  bottom  information  set. 
That  is,  it  must  be  that  the  plaintiff  chooses  “trial”  at  both  information 
sets.  By  Bayesian  updating,  at  the  middle  information  set  the  plaintiff  must 
attach  probability  q to  the  negligent  type  and  probability  (1-^)  to  the  non- 
negligent  type.  Hence  at  the  middle  information  set  “trial”  is  sequentially 
rational  if  and  only  if  qW  - P > S,  that  is,  q > ■ In  order  for  “trial”  to  be 

sequentially  rational  at  the  bottom  information  set,  the  plaintiff  must  attach 
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sufficiently  high  probability  (namely  /?>-|-)  to  the  negligent  type.  This  is 
allowed  by  weak  sequential  equilibrium  because  the  bottom  information  set 
is  not  reached.  Finally,  in  order  for  “sue”  to  be  sequentially  rational  it  must 
be  that  qW  - P > 0,  that  is,  q>-jp , which  is  implied  by  q > ^f- . Thus 

there  is  a pooling  equilibrium  with  ((sue, trial, trial), (offer, offer) ) 

if  and  only  if  q > — + ^ . 

7 W 


Now  consider  Case  2 (both  types  of  the  defendant  choose  “resist”).  [Note: 
since  in  part  (f)  the  restriction  S <W  -P  does  not  hold,  we  will  carry  out 
the  analysis  below  at  first  without  imposing  the  restriction.]  If  the 
plaintiff’s  strategy  involves  “drop”  at  the  bottom  information  set,  then  it  is 
indeed  sequentially  rational  for  both  types  of  the  defendant  to  choose 
“resist”.  Furthermore,  “drop”  is  sequentially  rational  in  this  case  if,  and  only 
if,  qW  - P < 0,  that  is,  g<|r.  Then  “sue”  is  also  sequentially  rational,  since 
the  Plaintiff’s  payoff  is  0 no  matter  whether  he  sues  or  does  not  sue.  Thus 


there  is  a pooling  equilibrium  with  ((sue, x, drop), (resist, resist))  if  and 

p 

only  if  q < — and  appropriate  beliefs  as  follows  (p  is  the  probability 

on  the  left  node  of  the  unreached  middle  information  set): 

S + P 

either  x = settle  and  any  p if  FT  < S + P or  p < if  W > S + P 

7 r W 

S + P 

or  x = trial  and  p > — , which  requires  W > S + P (since  p < 1). 


Since  it  is  assumed  that  W > S + P , we  can  conclude  that 


((sue, settle, drop), (resist, resist))  is  an  equilibrium  if  and  only  if 

. P ^ S + P 

q < — with  p < 

W W 


((sue, trial, drop), (resist, resist))  is  an  equilibrium  if  and  only  if 

P . . S + P 

q < — with  p > . 

W W 
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If,  on  he  other  hand,  q > ^ , then  “trial”  is  sequentially  rational  at  the 
bottom  information  set.  Then,  in  order  for  the  non-negligent  type  of  the 
defendant  to  choose  “resist”  it  must  be  that  the  plaintiff’s  strategy  involves 
“trial”  also  at  the  middle  information  set,  for  which  we  need  him  to  assign 
probability  p > to  the  negligent  type  (which  is  possible,  since  the 
middle  information  set  is  not  reached);  of  course,  this  requires  JV  > S + P . 
Thus, 


((sue, trial, trial), (resist, resist))  is  an  e 

P . , 

q>  — with  p > 
W 


quilibrium  if  and  only  if 
S + P 
W 


(f)  Note  that  here  the  restriction  W - P > S does  not  hold.  In  this  case, 
q < ^r-  = | and  thus,  by  the  previous  analysis,  there  is  no  pooling 

equilibrium  of  type  (1),  that  is,  ((sue, trial, trial), (offer, offer))  is  not  an 
equilibrium. 

Also,  since  S + P>W  there  is  no  pooling  equilibrium  of  type  (2),  that  is, 
neither  ((sue, trial, trial), (resist, resist))  nor  ((sue, trial, drop), (resist, resist))  is  an 
equilibrium. 

However,  there  is  a pooling  equilibrium  of  type  (2)  with 
((sue, settle, drop), (resist, resist))  with  any  beliefs  at  the  middle  information 
set,  since  “settle”  strictly  dominates  “trial”  there  (and,  of  course,  belief 
q = rjV  on  the  left  node  of  the  bottom  information  set). 
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Incomplete  Information : 

The  Type-Space  Approach 

15.1  Types  of  players 

As  noted  in  Chapter  13,  the  theory  of  “games  of  incomplete 
information”  was  pioneered  by  John  Harsanyi  (1967-68).  Harsanyi’s 
approach  was  developed  using  a different  approach  from  the  one  we 
employed  in  Chapters  13  and  14,  which  is  based  on  the  interactive  knowledge- 
belief  structures  introduced  in  Chapter  8.  The  interactive  knowledge  structures 
of  Chapter  8 are  in  fact  a special  case  of  the  more  general  notion  of  interactive 
Kripke  structure,  named  after  the  philosopher  and  logician  Saul  Kripke,  whose 
work  on  this  goes  back  to  1959  and  was  written  while  he  was  still  an 
undergraduate.8  Although  well  known  among  logicians  and  computer 
scientists,  these  structures  were  not  known  to  game  theorists.  Perhaps,  if 
Harasnyi  had  been  aware  of  Kripke’s  work  he  might  have  developed  his  theory 
using  those  structures.  We  find  the  interactive  Kripke  structures  more  natural 
and  elegant  and  thus  prefer  to  explain  the  “theory  of  games  of  incomplete 
information”  using  those  structures.  In  this  chapter  we  will  explain  the  “type- 
space”  approach  developed  by  Harsanyi  and  show  that  the  two  approaches  are 
equivalent.  We  will  limit  ourselves  to  situations  of  incomplete  information 


7 We  use  quotation  marks  because,  strictly  speaking,  there  is  no  such  thing  as  a game  of 
incomplete  information.  There  are  situations  of  incomplete  information  involving  the 
playing  of  a game  and  what  Harsanyi  did  was  to  suggest  a way  of  transforming  such 
situations  into  extensive-form  games  with  imperfect  information.  Once  the  so-called 
“Harsanyi  transformation”  has  been  applied,  the  resulting  game  is  a game  of  complete 
information.  Thus  the  “theory  of  games  of  incomplete  information”  is  a theory  on  how  to 
represent  a situation  of  incomplete  information  concerning  the  playing  of  a game  and  how  to 
transform  it  into  a dynamic  game  with  imperfect,  but  complete,  information. 

8 Kripke  (1959,  1963).  For  further  details  the  reader  is  referred  to  van  Ditmarsch  et  al.  (2015). 
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involving  static  games.  We  will  begin  in  the  next  section  with  a simpler  special 
case,  which  is  often  all  that  one  finds  in  game-theory  textbooks,  and  then 
explain  the  general  case  in  Section  15.3. 


15.2  Types  that  know  their  own  payoffs 

We  take  as  a starting  point  the  case  of  incomplete  information  concerning  a 
strategic-form  game-frame  \ .9  In  the  special  case  considered  in 

this  section,  any  uncertainty  Player  i has  (if  any)  concerns  either  the  von 
Neumann-Morgenstern  utility  function  U]  : O — > M.  ( j ^ / ) of  another  player 

(or  several  other  players)  or  the  beliefs  of  the  other  player(s)  (or  both).  Within 
the  approach  of  Chapter  13,  we  would  represent  such  a situation  with  an 
interactive  knowledge-belief  structure,10  by  associating  with  every  state  co  a 
game  based  on  the  given  game-frame.11  Let  Ui  m : O — > IR.  be  the  utility  function 

of  Player  i at  state  co.  The  assumption  that  every  player  knows  her  own  payoffs 
is  captured  by  the  following  restriction: 

if  0,0'gQ  and  co'  e /.(&>)  then  Ui  co.  = Uie>. 

An  example  of  such  a situation  of  incomplete  information  is  given  in  Figure 

12 

15.1,  which  reproduces  Figure  13.3.  “ 


Recall  (Definition  1.1,  Chapter  1)  that  the  elements  of  (fi,{S1,...,Sn),0,f'j  are  as  follows: 
I = {1 ,...,«}  is  a set  of  players,  (2)  Si  is  the  set  of  strategies  of  Player  i e I (and  S = S,  x...xSn  is 
the  set  of  strategy  profiles),  O is  a set  of  outcomes  and  / : S — > O is  a function  that  associates 
with  every  strategy  profile  s an  outcome  f(s)eO. 

10  Recall  that  an  interactive  knowledge-belief  structure  consists  of  a set  of  states  f2,  n partitions 

of  Q and  a collection  of  probability  distributions  on  Q,  one  for  each  information  set 
of  each  partition,  whose  support  is  a subset  of  that  information  set.  Recall  also  that,  for  every 
state  twe  f2,  we  denote  by  I,{co)  the  information  set  of  Player  i (that  is,  the  element  of  the 
partition  j)  that  contains  state  co. 

11  Recall  that,  given  a game-frame  \I,{Sl,...,Sn),0,f')J  a game  based  on  it  is  obtained  by 
specifying,  for  every  Player  i,  a von  Neumann-Morgenstern  utility  function  Ui : O — » R on 
the  set  of  outcomes. 

12  In  this  case  the  game-frame  is  given  by:  I = {1,2},  = {T,B},  S2  = {T,R}  (so  that  S = {( T,L ), 

(T,R),  (B,L),  (. B,R )}),  0={ol,o2,oi,o4},  f((T,L))  = ol,f((T,R))  = o2,  f((B,L))  = o3, 
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2 


1 


L R 
T 

B 


0,3  3,9 


3,3  0,0 


1 is  of  type  a 

© 


2 

L R 


6,3 

0,9 

3,3 

3,0 

1 is  of  type  b 

® 


true  state 


Figure  15.1 


In  the  “type-space”  approach  the  situation  illustrated  in  Figure  15.1  would  be 
represented  using  types  of  players:  each  type  of  Player  i represents  a utility 
function  of  Player  i as  well  as  Player  is  beliefs  about  the  types  of  the  other 
players.  The  formal  definition  is  as  follows. 


/ = o4-  The  (state-dependent)  utility  functions  are  given  by  Ula  =^‘ 


Uup  = 


°'  °2  °4  | and  U2„=U2^  = 


2 3 4 

3 9 3 0 
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Definition  15.1.  A static  Bayesian  game  of  incomplete  information  with 
knowledge  of  one’s  own  payoffs  consists  of  the  following  elements: 

• a set /=  {l,...,n}  of  players; 

• for  every  Player  z e /,  a set  St  of  strategies  (as  usual,  we  denote  by  S the  set 
of  strategy  profiles); 

• for  every  Player  i e I,  a set  Tt  of  possible  types;  we  denote  by 
T = Tlx...xTn  the  set  of  profiles  of  types  and  by 
7Y  = 7]  x...xTi  j xTm  x...xTn  the  set  of  profiles  of  types  for  the  players  other 
than  i; 

• for  every  Player  i and  for  every  type  t.  e 7]  of  Player  i,  a utility  (or  payoff) 
function  Ui  t : S — > R ; 

• for  every  Player  i and  for  every  type  ti  ef  of  Player  i,  a probability 
distribution  Pit  :T_.—>\ 0,1]  representing  the  beliefs  of  type  t{  about  the 
types  of  the  other  players. 

The  beliefs  of  all  the  types  are  said  to  be  Harsanyi  consistent  if  there  exists  a 
common  prior,  that  is,  a probability  distribution  P : T —>[0,1]  such  that,  for 
every  Player  i and  for  every  type  ti  e T.  of  Player  i,  P.  t coincides  with  the 
probability  distribution  obtained  from  P by  conditioning  on  the  event  {t(  }, 


that  is  (denoting  a profile  t e T by  (t_t  ,tf), 


PtAt-t)  = 


£ p(M) ' 


Let  us  recast  the  situation  illustrated  in  Figure  15.1  in  the  terminology  of 
Definition  15.1.  First  of  all,  we  have  that  7 = {1,2},  5]  = {T,B},  S2  = {L,R}  (so 

that  S = {( T,L ),  (T,R),  ( B,L ),  (5,7?)}).  Furthermore  there  are  two  types  of  Player 
1 and  only  one  type  of  Player  2:  7]  = {f , f } , T2={t2},  so  that 

T = Jfftftf)}  . The  utility  functions  are  given  by: 


\T,L)  (: T,R ) 

(B,L) 

(B,Rf 

TT  — 

(T,L) 

(T,R) 

(B,L)  (. B,Rf 

v 0 3 

3 

0 7 

5 ^ \ ,tf 

v 6 

0 

3 3 y 

f{T,L)  ( T,R ) (B,L)  ( B,R 
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The  beliefs  are  given  by  P „ = P 


prior  is  P = 


(*iV2) 

2 1 

3 3 J 


and  the  common 


Before  we  explain  in  detail  how  to  transform  a “state-space”  structure  into  a 
“type-space”  structure  and  vice  versa,  we  give  one  more  example  in  which  there 
is  double-sided  incomplete  information.  Consider  the  situation  illustrated  in 
Figure  15.2,  which  reproduces  Figure  13.8.  The  true  state  is  a. 


Figure  15.2 


In  this  case  we  have  that  each  player  has  two  types:  we  can  identify  a type  of  a 
player  with  a cell  of  the  player’s  information  partition.  Thus  we  have  that 


\A,C)  (. A,D ) C B,C ) (B,D)' 
1 0 0 3 , 


\A,C)  ( A,D ) ( B,C ) (B,D)' 

v 3 i o i y 


'( A,C ) (A,D)  (B,C)  (5,D)3  p 

v 0 2 0 1 J’  « 


and  P 4 

2,q 
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common  prior  is  given  by  P = 


(t:x) 


XA) 

o 


(*l,t 2) 


Thus  the 


two  types  of  Player  1 have  the  same  utility  function  but  different  beliefs  about 
the  types  of  Player  2,  while  the  two  types  of  Player  2 differ  both  in  terms  of 
utility  function  and  in  terms  of  beliefs  about  the  types  of  Player  1. 


From  the  last  example  it  should  be  clear  how  to  transform  a “state-space” 
model  into  a “type-space”  model.  First  of  all,  for  every  Player  i,  create  one  type 
for  every  cell  of  Player  i’s  partition,  making  sure  that  different  cells  are 
associated  with  different  types.  In  this  way  we  have  identified  each  state  with  a 
profile  of  types.  Since  there  is  a probability  distribution  over  each  information 
set  of  Player  i,  that  probability  distribution  will  give  us  a probability 
distribution  for  the  associated  type  of  Player  i over  some  set  of  type-profiles  for 
the  other  players.  Finally,  since  our  hypothesis  (to  be  relaxed  in  the  next 
section)  is  that  each  player  knows  her  own  utility  function  (that  is,  the  utility 
function  of  a player  does  not  vary  from  state  to  state  within  the  same 
information  set  of  that  player),  with  each  type  tt  of  Player  1 is  associated  a 
unique  utility  function  Ui , . 


Conversely,  we  can  convert  a “type-space”  structure  into  a “state-space” 
structure  as  follows.  Let  the  set  of  states  be  the  set  T of  profile  of  types.  For 
every  Player  i and  for  every  two  states  t,t'  eT , let  t and  t'  belong  to  the  same 
information  set  of  Player  1 (that  is,  to  the  same  cell  of  Player  i ’s  partition)  if  and 
only  if  Player  i’s  type  is  the  same  in  t and  t' : t'  e It{t)  if  and  only  if  tt  = t\ . 
The  beliefs  of  each  type  of  Player  i then  yield  a probability  distribution  over 
the  information  set  of  Player  i corresponding  to  that  type.  An  example  of  this 
transformation  is  given  in  Exercise  15.2. 


This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced 
in  this  section,  by  going  through  the  exercises  in  Section  15. E. 2 of  Appendix 
15. E at  the  end  of  this  chapter. 
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15.3  The  general  case 

As  pointed  out  in  Remark  13.1  (Chapter  13),  it  may  very  well  be  the  case  that  a 
rational  player  does  not  know  her  own  payoffs  (for  example,  because  she  is 
uncertain  about  what  outcomes  might  occur  if  she  chooses  a particular  action). 
Uncertainty  about  a player’s  own  payoffs  is  compatible  with  the  player 
knowing  her  own  preferences  (that  is,  how  she  ranks  the  outcomes  that  she 
considers  possible).  Definition  15.1  is  not  general  enough  to  encompass  such 
possibilities.  The  following,  more  general,  definition  allows  the  utility  function 
of  a player  to  depend  not  only  on  the  player’s  own  type  but  also  on  the  types 
of  the  other  players.  Definition  15.2  is  identical  to  Definition  15.1  except  for 
the  fourth  and  fifth  bulleted  items,  which  have  been  highlighted  in  bold  red 
characters,  and  the  additional  part  in  the  last  bulleted  item,  which  is  also 
highlighted  in  bold  red  characters. 

Definition  15.2.  A static  Bayesian  game  of  incomplete  information  consists  of 
the  following  elements: 

• a set  1=  {\,...,n}  of  players; 

• for  every  Player  i e /,  a set  St  of  strategies  (as  usual  we  denote  by  S the  set 
of  strategy  profiles); 

• for  every  Player  i e I,  a set  7)  of  possible  types;  we  denote  by 
T = Tlx...xTn  the  set  of  profiles  of  types  and  by 
T_.  =T1x...xTi_1xTi+lx...xTn  the  set  of  profiles  of  types  for  the  players  other 
than  i; 

• a set  Y cz  T of  relevant  profiles  of  types; 

• for  every  Player  i and  for  every  profile  of  types  t <=Y , a utility  (or 
payoff)  function  Ui . , : S — » R ; 

• for  every  Player  i and  for  every  type  tt  e 7 of  Player  i,  a probability 
distribution  Pit  : T_.  — > [0,1]  representing  the  beliefs  of  type  t{  about  the 
types  of  the  other  players  satisfying  the  restriction  that  if  PiJft_,)>  0 
then  (ti,t_i)  e Y . 

As  in  the  special  case  considered  in  the  previous  section,  also  in  the  general  case 
one  can  transform  a “state-space”  structure  into  a “type-space”  structure  and 
vice  versa. 
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Given  a “state-space”  structure,  for  every  Player  i we  identify  the  cells  of  Player 
z’s  partition  with  the  types  of  Player  i (one  type  for  every  information  set). 
Since  there  is  a probability  distribution  over  each  information  set  of  Player  i, 
that  probability  distribution  will  yield  the  probability  distribution  for  the 
associated  type  of  Player  i over  some  set  of  type-profiles  for  the  other  players. 
Finally,  having  identified  each  state  with  a profile  of  types  (since  each  state 
belongs  to  one  and  only  one  information  set  of  each  player),  we  can  assign  to 
the  corresponding  type  of  Player  i the  utility  function  of  Player  i at  that  state. 
We  shall  illustrate  this  conversion  using  the  “state-space”  structure  shown  in 
Figure  15.3  below  (taken  from  Exercise  13.2).  The  corresponding  “type-space” 
structure  (static  Bayesian  game  of  incomplete  information)  is  the  following: 
/ = {1,2}  (letting  Bill  be  Player  1 and  Ann  Player  2),  5}  = {g,ng},  S2  = {a,rj  (so 

that  S = {( g,a ),  (g,r),  ( ng,a ),  (ng, r) }) . Furthermore  there  are  two  types  of  Player 
1 and  only  one  type  of  Player  2:  Tx  = {t( , t\ } (/"stands  for  ‘friend’  and  e for 

‘enemy’),  T2  = {t2} , so  that  T = j (t(  ,t2),(tl,t2) } ■ In  this  case  we  have  that  Y = T. 
The  utility  functions  are  given  by: 


U f =U  . 

! U|  J2 ) t(ri  .^2 ) 


u. 


2 AH,h) 


u. 


2,(?1  tt 2 ) 


\g,a)  (g,r)  (ng,a)  (ng,rf\ 
v 1-10  0 
f (g,a)  (g,r)  (. ng,a ) (ng,rf | 
10  0 0 
(g,a)  ( g,r ) (ng,a)  (. ng,r ) 
-10  0 0 


p , = 

ro 

p = 

G7 

ll 

te  ^ 
ii 

1,?1 

10 
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Ip 
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ng 
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© ® 


2: 


true  state 

Figure  15.3 

Conversely,  we  can  convert  a “type-space”  structure  into  a “state-space” 
structure  as  follows.  Let  the  set  of  states  be  the  set  Y of  relevant  profiles  of 
types.  For  every  Player  z and  for  every  two  states  t,t'  eY , let  t and  t'  belong 
to  the  same  information  set  of  Player  z (that  is,  to  the  same  cell  of  Player  z’s 
partition)  if  and  only  if  Player  z’s  type  is  the  same  in  t and  t' : t'  e I.(t)  if  and 

only  if  ti  - t\ . The  beliefs  of  each  type  of  Player  z then  yield  a probability 
distribution  over  the  information  set  of  Player  z corresponding  to  that  type. 
An  example  of  this  transformation  is  given  in  Exercise  15.3. 

This  is  a good  time  to  test  your  understanding  of  the  concepts  introduced 
in  this  section,  by  going  through  the  exercises  in  Section  15. E. 3 of  Appendix 
15.E  at  the  end  of  this  chapter. 
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Appendix  15. E:  Exercises 

15.E.2.  Exercises  for  Section  15.2:  Types  that  know  their  own  payoffs 

The  answers  to  the  following  exercises  are  in  Appendix  S. 

Exercise  15.1.  Transform  the  following  situation  of  incomplete  information 
into  a “type-space”  structure. 

game  game  game  game 

Gj  G2  Gj  G2 


H 

Pi  i y ' 

0 

a 1 

P f y 1 

i5 

3: 


Player  2 

C D 

A 

Player  1 B 

4 

1 

0 

2 

4 

4 

1 

0 

0 

0 

Player  3 chooses  E 

Player  2 

C D 

A 

Player  1 B 

2 

0 

0 

1 

0 

0 

0 

0 

2 

Player  3 chooses  F 

GAME  G, 

Player  2 

C D 

A 

Player  1 B 

4 

4 

2 

1 

4 

4 

0 

1 

Player  3 chooses  E 

Player  2 

C D 

A 

Player  1 B 

2 

1 

0 

0 

0 

2 

0 

0 

Player  3 chooses  F 

GAME  G2 
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Exercise  15.2.  Consider  the  following  Bayesian  game  of  incomplete 
information: 


/ = {!, 2},  SX  = {A,B},  S2  = {C,D } Tx  = {t:X },  T2  = {f2,tb2,f2} 


= 


AC  AD  BC  BD 


1 


0 


^ = 


AC  AD  BC  BD 


0 0 


1 


c ..  =u. , = 


2 ,ti 


fAC 

AD 

BC 

BD ^ 

, uc  = 

fAC 

AD 

BC  BD\ 

v 2 

1 

2 

3 J 

2,4 

V 0 

2 

2 0 j 

P a = 

U“ 


P c = 


( +b  fc\ 

L L ^ ^ 

i 2 0 

V 4 4 U 


yfl  jb  \ 

' 1 H 

0 1 


^ = 


C *2 


4_  _3_ 

V 13  13 


_6 

i3  y 


P2/‘  = 


t t 
L\ 


I 
V 3 


b\ 


2 

3 y 


p - 

2,4 


m h 


2 

V 3 


1 

3 J 


(a)  Are  the  beliefs  of  the  types  consistent  (that  is,  is  there  a common  prior)? 

(b)  Transform  this  type-space  structure  into  an  interactive  knowledge-belief 
structure. 


15. E. 3.  Exercises  for  Section  15.3:  The  general  case 

The  answers  to  the  following  exercises  are  in  Appendix  S at  the  end  of  this 
chapter. 

Exercise  15.3.  Consider  the  following  Bayesian  game  of  incomplete 
information. 


I = {1,2,3} , S,=\A.B\,Sl=\C.D\,S,=>F.F\,  Tt  ={r,‘,(f},  T2={f24}, 


U, 


( ACE  ADE  BCE  BDE  ACE  ADF  BCE  BDF 


I 2 


0 


0 


0 


1 


1 


44) ' 


ACE  ADE  BCE  BDE  ACE  ADE  BCE  BDE 
12  0 0 10  0 1 


U, 


h(h  A ’H  ) 


ACE  ADE  BCE  BDE  ACF  ADF  BCF  BDF 


0 


0 


1 


0 


0 


1 
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(ACE  ADE  BCE  BDE  ACE  ADF  BCF  BDF " 
2’(G«)_[  2 4 2 0 0 1 0 2 , 

(ACE  ADE  BCE  BDE  ACF  ADF  BCF  BDF  \ 

4 1 4 1 1 0 2 0 J 

( ACE  ADE  BCE  BDE  ACF  ADF  BCF  BDF  \ 

U2,(tlti4)~y  4 3 1 2 0 1 0 2 J 

(ACE  ADE  BCE  BDE  ACF  ADF  BCF  BDF  \ 

UUtC<°i4)-y  02002000J 

(ACE  ADE  BCE  BDE  ACF  ADF  BCF  BDF  \ 

U3,(tC‘b2,‘D~UH>C44)~[  3 2 0 1 1 2 2 0 J 


(a)  Are  the  beliefs  of  the  types  consistent  (that  is,  is  there  a common  prior)? 

(b)  Transform  this  type-space  structure  into  an  interactive  knowledge-belief 
structure. 
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000000000000 


Exercise  15.4:  Challenging  Question. 

Consider  the  following  Bayesian  game  of  incomplete  information: 

/ = {E2},  St={T,B},  S2  = {L,R} , Tl  = {tfX},  T2  = {ta2  ,tb2  { , 

f(T,L)  (T,R)  ( B,L ) (B,R^ 


UmAA)  UmAA) 


U B . =U  B b 
i a?  A)  i a?  A) 


v 6 0 3 3 j 

r(T,L ) (T,R)  ( B,L ) 

0 


) 


U. 


2 ,UfA)  2,(hAA)  U 


Put  = 


U>?A)  2 ,o*A) 


3 3 0 

'(T,L)  (T,R)  (. B,L ) 
v 3 9 3 0 


( *2 

t?) 

ft" 

( tA  tB ^ 

L\  L\ 

and  P b = 

ftA 

L\ 

tB\ 

L\ 

2 

, P B = 

2 

*2  1 

, P „ = 

1 

V 2 

1 

2 ) 

Ui 

3 

V 5 

tJ 

2 A 

1 3 

V 4 4 J 

2,4 

1 

V 3 

2 

3 y 

(a)  Transform  the  situation  described  above  into  an  interactive  knowledge- 
belief  structure.  Assume  that  the  true  state  is  where  Player  1 is  of  type  tf 
and  Player  2 is  of  type  t“ . 


(b)  Apply  the  Harsanyi  transformation  to  obtain  an  extensive-form  game. 

(c)  Write  the  strategic-form  game  corresponding  to  the  game  of  part  (b). 

(d)  Find  all  the  pure-strategy  Bayesian  Nash  equilibria.  Does  any  of  these  yield 
a Nash  equilibrium  in  the  true  game  being  played? 

(e)  Of  the  pure-strategy  Bayesian  Nash  equilibria  select  one  where  Player  1 is 
uncertain  about  Player  2’s  choice  of  action.  For  this  equilibrium  complete 
the  structure  of  part  (a)  by  turning  it  into  a model  (that  is,  associate  with 
each  state  an  action  - not  a strategy  - for  each  player)  and  verify  that  at  the 
true  state  there  is  common  knowledge  of  rationality. 
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Appendix  15. S:  Solutions  to  exercises 


Exercise  15.1.  The  structure  is  as  follows  (the  elements  are  given  as  listed 
Definition  15.1).  / = {1,2,3},  Sl  = {A,B},  S2  = {C,D},  S3  = {E,F}, 

T =ita  th  T =ita  tb\  T =ita  tb\ 

‘ 1 pi  ’ll  5*1  j 1 2 p2  ’l2  j ’ J 3 p3  ’l3  j ’ 


17  =U  b=U  = 

l,r,  l,tf  i,<f 


/l  CE  ADE  BCE  BDE  ACF  ADF  BCE  BDF 


0 


0 


0 


0 


U»S  = 


= 


' ACE  ADE  BCE  BDE  ACF  ADF  BCF  BDF 
14  10  0 10  2 


ACE  ADE  BCE  BDE  ACF  ADF  BCF  BDF 
4 14  110  2 0 


U,<  =U*  = 


' ACE  ADE  BCE  BDE  ACF  ADF  BCF  BDF 
^02002000 


P . = 


'(«) 

V 1 2 

f 


, P,= 


(is,)  (<2v,T 


P . = 

2 A 


P . = 

3 .<? 


V 

b ,b  D 


P c = 


(<;.<?)  (4,4) 

1 1 

9 9 

(«) 


TiT)  (T>7) 


\ 9 


2 

9 y 


P = 

2/, 


P,A  = 


'{iAf 

1 


2 

V 3 

f l J) 


(4,4)  (<■4)'' 


2_ 

V 3 


The  common  prior  is  given  by 
P = 


f j.a -a iCiiaA)  .a  A)  .a  .aJj^b  J)^aJ)  AA-a  A A J)  jC^a^a  .c -a  A j.cAu.a  j.cAA\ 

^1  H l2  f2  f3  ^1  *2*3  ^1  *2^3  H ^2^3  ^1  *2^3  ^1  ^2*3  ^1  *2*3  ^1  ^2^3  D ^2^3  ^1  *2*3 

v^0000^^0000-^ 


12  y 


Exercise  15.2.  (a)  Yes,  the  following  is  a common  prior: 

'(?,£)  (M)  0f^2a)  (M) 


P = 


2 

V 21 


_6_ 

21 


0 


_4_ 

21 


_3_ 

21 


6 

21  y 


(b)  The  knowledge-belief  structure  is  as  follows:  the  set  of  states  is 
Q = {a,p,y,8,s},  where  a = (t“,t“),  P = {t‘‘,tb2),  y = (tb,t“),  S = (tb,tb), 
s = (tb,f2). 
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2 

2 

C 

D 

C 

D 

.4 

CM 

1 , 1 

A 

CM 

■'f 

1 , 1 

5 

CM 

O 

2 , 3 

B 

CM 

O 

2,  3 

(j  n n § n £) 


a i 

3 

PI 

2 Y 

3 

8 1 

Exercise  15.3.  (a)  Yes:  the  following  is  a common  prior: 

and  P(t)  = 0 for  every  other  t eT . 


P = 


f .aja.a 

1 1 *2 
1 


1 


1 


ib  jb  A) 

i\  1 2 r3 


io  y 


(b)  First  define  the  following  games: 


Player  2 


A 

2 

2 

2 

4 

Player  1 B 

3 

2 

0 

0 

0 

0 

Player  3 chooses  E 

Player  2 
C 

D 

A 

0 

0 

0 

1 

0 

Player  1 B 

1 

0 

0 

1 

2 

0 

Player  3 chooses  F 


GAME  G} 


Player  2 


A 

Player  1 B 


A 

Player  1 B 


1 

4 

2 

1 

0 

4 

0 

1 

1 

GAME  G\ 


Player  3 chooses  E 
Player  2 

C D 


1 

1 

1 

0 

0 

0 

2 

1 

0 

Player  3 chooses  F 


Player  2 


A 

0 

4 

0 

3 

Player  1 B 

2 

1 

1 

2 

1 

Player  3 chooses  E 

Player  2 
C 

D 

A 

2 

0 

0 

1 

Player  1 B 

0 

0 

0 

1 

2 

Player  3 chooses  F 


GAME  Gh 
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Then,  letting  a = /?  = Y = (tb,ta2 ,t\ ),  8 = (tb,tb2,tb ),  the 

interactive  knowledge-belief  structure  is  as  follows: 

game  game  game  game 


Gi 


G, 


G, 


Exercise  15.4  (Challenging  Question)  (a)  The  interactive  knowledge-belief 
follows,  where  a =[tf  ,t“)’  ) 3 = [tf,t‘ j),  / = {tf  8 = ^,tb): 


structure  is  as 


1: 


2 

L R 


types:  1 A & 2a 


2 

L R 


2 

L R 


T 

6,  3 

0,  9 

T 

6,  3 

0,  9 

T 

0,  3 

3,  9 

1 

1 

1 

B 

3,  3 

3,  0 

B 

3,  3 

3,  0 

B 

3,  3 

0,  0 

2 

L R 


0,  3 

3,  9 

3,  3 

0,  0 

types:  1 A & 2b 


types:  1 B & 2a 


types:  1 B & 2b 


• 1/2 

1/2  • 

3/5  • 

2/5  • 

a 

P 

Y 

5 

game  is  as  follows: 
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(c)  The  strategy  profiles  give  rise  to  the  following  lotteries: 
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Player  2 

L if  type  2a  L if  type  2a  R if  type  2a  R if  type  2a 

L if  type  2b  R if  type  2b  L if  type  2b  R if  type  2b 


T if  type  1A 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

T if  type  IB 

TL 

TL 

TL 

TL 

TL 

TR 

TL 

TR 

77? 

TL 

TR 

TL 

55 

55 

55 

55 

6 

6 

0 

0 

6 

0 

0 

3 

0 

6 

3 

0 

0 

0 

3 

3 

3 

3 

3 

3 

3 

9 

3 

9 

9 

3 

9 

3 

9 

9 

9 

9 

T if  type  1A 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

B if  type  IB 

TL 

TL 

BL 

BL 

TL 

TR 

BL 

BR 

TR 

TL 

57? 

BL 

55 

55 

55 

55 

6 

6 

3 

3 

6 

0 

3 

0 

0 

6 

0 

3 

0 

0 

0 

0 

3 

3 

3 

3 

3 

9 

3 

0 

9 

3 

0 

3 

9 

9 

0 

0 

B if  type  1A 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

T if  type  IB 

BL 

BL 

TL 

TL 

BL 

BR 

TL 

TR 

A/? 

BL 

77? 

TL 

55 

55 

55 

55 

3 

3 

0 

0 

3 

3 

0 

3 

3 

3 

3 

0 

3 

3 

3 

3 

3 

3 

3 

3 

3 

0 

3 

9 

0 

3 

9 

3 

0 

0 

9 

9 

B if  type  1A 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

1/7 

1/7 

3/7 

2/7 

B if  type  IB 

BL 

BL 

BL 

BL 

BL 

BR 

BL 

BR 

BR 

BL 

57? 

BL 

55 

55 

55 

55 

3 

3 

3 

3 

3 

3 

3 

0 

3 

3 

0 

3 

3 

3 

0 

0 

3 

3 

3 

3 

3 

0 

3 

0 

0 

3 

0 

3 

0 

0 

0 

0 

Thus  the  strategic  form  is  as  follows: 

Player  2 

L if  type  2a  L if  type  2a  R if  type  2a  R if  type  2a 

L if  type  2b  R if  type  2b  L if  type  2b  R if  type  2b 


T if  type  1A 
T if  type  IB 


Player  T if  type  1A 
B if  type  IB 


1 B if  type  1A 
T if  type  IB 


B if  type  1A 
B if  type  IB 
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(d)  The  pure-strategy  Nash  equilibria  are:  ( TB,LL ),  ( TB,LR ) and  ( BT,RR ).  None 
of  them  yields  a Nash  equilibrium  of  the  game  associated  with  state  a,  since 
neither  (T,L)  nor  ( B,R ) are  Nash  equilibria  of  that  game. 

(e)  The  only  pure-strategy  equilibrium  where  Player  1 is  uncertain  of  Player  2’s 
choice  is  ( TB,LR ).  The  corresponding  model  is: 


2 2 2 


L 

R 

L 

R 

L 

R 

T 

6,  3 

0,  9 

T 

6,  3 

0,  9 

T 

0,  3 

3,  9 

1 

1 

1 

B 

3,  3 

3,  0 

B 

3,  3 

3,  0 

B 

3,  3 

0,  0 

2 

L R 


0,  3 

3,  9 

3,  3 

0,  0 

types:  1 A & 2a 


types:  1 A & 2b 


• 1/2 

1/2  • 

-J 

types:  IB  & 2a  types:  IB  & 2b 


A 

true  state 


There  is  common  knowledge  of  rationality  because  at  every  state  both  players  are 
rational.  In  fact,  at  states  a and  /?  Player  1 has  an  expected  payoff  of  3 from  both  T 
and  B (thus  T is  a best  reply)  and  at  states  y and  S Player  1 has  an  expected  payoff  of 
| from  B and  -f  from  T (thus  B is  a best  reply).  At  states  a and  y Player  2 has  an 
expected  payoff  of  3 from  L and  j from  R (thus  L is  a best  reply)  and  at  states  /?  and 
S Player  2 has  an  expected  payoff  of  3 from  both  L and  R (thus  A is  a best  reply). 
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